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NON-PATHOGENIC STRAINS OF HTV-1 

5 The present invention relates to non-pathogenic strains of HTV-1 and to components, 
parts, fragments and derivatives thereof and to genetic sequences derived therefrom and 
their use in the development of diagnostic and therapeutic compositions for the treatment 
and prophylaxis of AIDS and AIDS-related disorders. The present invention also relates 
to a method for attenuating pathogenic strains of HIV-1 by mutagenizing particular 
10 regions of the HIV-1 genome. 

Bibliographic details of the publications referred to in this specification are collected at 
the end of the description. Sequence Identity Numbers (SEQ ID NOs.) for the 
nucleotide and amino add sequences referred to in the specification are defined 
IS following the bibliography. 

Throughout this specification, unless the context requires otherwise, the word 
"comprise", or variations such as "comprises 1 " or "comprising", will be understood to 
imply the inclusion of a stated element or integer or group of elements or integers but 
20 not the exclusion of any other element or integer or group of elements or integers. 

Genomic nucleotide sequences of HIV-1 strains referred to herein are represented by 
their corresponding DNA sequence. 

25 Exemplary viral isolates referred to herein as "CIS" and "C98" were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACC), Division of Biologies, Porton Down, Salisbury, Wiltshire SP4 
OJG. CI 8 was deposited on 17 October, 1994 under Provisional Accession Number 
V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 

30 Number V941031169. Viral isolate "C54" was deposited at ECACC on 14 February, 
1995 under Provisional Accession Number ' 
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A summary of particular deletion mutants of HTV- 1 of the present invention referred to 
herein is given in Figure 11. 

Acquired Immune Deficiency Syndrome (AIDS) and AIDS related disorders are the 
5 clinical result of infection by Human Immunodeficiency Virus type I (HTV-1) (Barre- 
Sinoussi et al, 1983). Infection by HTV-1 is generally characterised by progressive 
immune system damage (Teeuwsen et of, 1990; Clerici et al, 1989) leading to 
opportunistic infections, malignancies or wasting syndrome that constitute clinically- 
defined AIDS (Busch et at, 1991; Klaslow et al, 1990). 

10 

The high mortality rate of individuals infected with HIV-1 together with the social and 
economic consequences of the continuing HIV-1 epidemic has created an urgent need 
for a safe and effective treatment and/or prophylaxis against the devastating effects of 
AIDS. However, despite over a decade of high level scientific research into the 

15 pathogenesis of HTV-1 and the clinical manifestations of the disease, together with a 
detailed molecular analysis of the virus, there has been little success in the development 
of an effective vaccine. To date, the most effective therapy is treatment with zidovudine 
(AZT) which delays the onset of fell blown AIDS and alleviates to some extent the 
symptoms of HIV-1 infection. However, AZT is not an innocuous compound and AZT, 

20 metabolic products thereof or impurities therein can cause a number of side effects 
which limit long term treatment with the drug. Furthermore, AZT resistant isolates have 
been reported during treatment Clearly, therefore, a need exists to develop alternative 
strategies in preventing and treating HTV-1 infection. 

25 The initial phases of HIV-1 infection are summarised by Levy (1993) as involving 
attachment, fusion and nucleocapsid entry. These phases have been the traditional foci 
in research into development of antiviral strategies. The molecular events at the virus 
genomic level have also been the subj ect of intense scientific research with an aim being 
the development of a live attenuated vaccine as a possible approach for the treatment or 

30 prophylaxis of HTV-1 infection. 
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There is a high variable rate of progression from initial HTV-1 infection to AIDS which 
reflects a rapidly chang in g pathogen and variable immune response of the host to 
infection (Sheppard et al, 1993). With regards to the latter, HIV-1 can be considered 
5 as a heterogenous group of viruses differing at the genetic level with concomitant 
variable pathogenicity. For example, HIV-1 strains can differ in their capacity to kill 
cells. Furthermore, it appears that HTV-1 strains evolve ina host after infection and that 
the evolution varies depending on the tissues infected by the virus. The major sites in 
the genome apparently responsible for biological and pathological variation are the 
10 highly variable envelope region (Cheng-Mayor et al, 1991; Shioda et al, 1992; Hwang, 
et al 1991 ; Sullivan et al, 1993; Groenink et al, 1993) and the viral regulatory regions 
such as tat (Leguern et al, 1993). The genetic complexity of the HTV-1 group of viruses 
together with their variable pathogenicity, are major difficulties in the development of 
live vaccines, genetic vaccines or component vaccines. 

15 

Notwithstanding the highly pathogenic nature of HIV-1, there are some reports of long 
term survival of subjects infected with the virus (Leannont et al, 1992; Levy, 1993; 
Sheppard et al, 1993; Lifson et al 1991). It is not always clear, however, whether a 
benign course following HIV-1 infection is due to host factors, viral factors or other 
20 unknown factors. There are reports that most infected people have at least laboratory 
evidence of progressive immune system damage in the form of CD4+ cell loss (Lang 
et al, 1989) and defective immune responses (Clerici et al, 1989). 

Although simian monkeys have been used as an in vivo model for HIV and Simian 
25 Immunodeficiency Virus (SIV) infection, a major handicap in AIDS research has been 
the absence of suitable in vivo models to study the pathogenesis of the disease and, in 
particular, to study the viruses involved in benign infection. The need for a suitable in 
vfvo model is heightened by the fact that results obtained in vitro cannot necessarily be 
extrapolated to what occurs in vivo. This was clearly observed by Mosier et of (1993) 
30 where conflicting results were obtained in animals compared to cell cultures. 
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Despite the absence of suitable in vivo models, considerable scientific research has been 
directed to attenuating HTV-1 strains by mutagenesis of the virus genome. Deletions in 
the nef gone have been implicated in attenuated strains of SIV and their use in providing 
5 protective effects in monkeys (Daniel et al, 1992). . However, there are conflicting 
reports on the possible negative influence the nef gene product has on the rate or extent 
of virus repUcation (Tewilliger etal, 1986; Luriwe/o/>1987; Niedennan et al, 1989; 
Kim et a!, 1989; Hammes et al f 1989). In fact, Kim et al (1989) found that nef did not 
affect HTV-1 replication or HTV-1 long terminal repeat (LTR^driven CAT expression. 

10 Kestler JEetal (1991) found that the nef gene is required for full pathogenic potential 
in SIV. However, such is the complexity of the HTV-1 group of viruses and the 
variability of immune responses between individuals let alone different species, that it 
is far from clear whether nef deleted strains of HTV-1 would behave similarly to fief 
deleted strains of STV-L There is a need, therefore, in order to investigate the possibility 

15 of nef deleted HTV-1 strain as a vaccine candidate, to identify individuals infected with 
such modified viruses. 

Leannont et al (1992) reported that a cohort of five persons infected with blood products 
from a single HTV-1 infected donor have remained asymptomatic from up to about 10- 

20 14 years after infection. Subsequently, a sixth person has been identified as being part 
of the cohort Both the donor and recipients were HTV-1 seropositive but with no 
indications of clinical symptoms of HTV-1 related disease and GD4+ cell number and 
(^-microglobulin levels have remained in the normal range. The identification of this 
cohort of benignly infected individuals provides a unique in vivo model in which the 

25 pathogenesis of HTV-1 infection can be studied at the clinical and molecular biological 
levels. 

However, it has not always possible using conventional isolation procedures to routinely 
and reproducibly isolate viral strains from the above mentioned donor or recipients 
30 which has frustrated efforts to investigate the cause of the asymptomatic individuals. 
In accordance with the present invention, methods have now been established to isolate 
viruses from the above individuals. It has been determined, in accordance with the 
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present invention, that the six individuals of the cohort are infected by non-pathogenic 
strains of HIV- 1. Furthermore, the non-pathogenic strains of HIV- 1 cany one or more 
nucleotide mutations. The non-pathogenic strains of the present invention enable the 
generation of a range of therapeutic, diagnostic and targeting agents against HTV-1 
5 infection. The present invention also enables the attenuation of previously pathogenic 
strains of HTV-L 

Accordingly, one aspect of the present invention contemplates a non-pathogenic isolate 
of HIV- 1 or a component, part, fragment or derivative thereof 

10 

In a related embodiment, there is provided a novel isolate of HTV-1 or a component, 
part, fragment or derivative thereof wherein said HTV-1 isolate is capable of stimulating 
in a human or primate subject an immune response such as a humoral immune response 
to at least one HTV-1 glycoprotein such as but not limited to gp41-45, gp!20 and/or 
15 gpl60 while not substantially reducing in said human or primate subject proliferative 
responses and cytokine production to a mitogen, alloantigen and/or recall antigen 
compared to a healthy, non-infected human or primate subject Preferably, the cytokine 
isIL-2. Preferably, the mitogen is ConA or PHA and the recall antigen is influenza or 
tetanus toxoid Preferably, the HTV-1 isolate is non-pathogenic. 

20 

More particularly, the present invention relates to an isolated HTV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries one or more mutations in its genome resulting in the inability to direct 
synthesis of at least one pathogenic HIV-l-derived polypeptide or protein. 

25 

Even more particularly, the present invention provides an isolated HIV-1 strain which: 

(i) is substantially non-pathogenic in human subjects; and 

(ii) carries a mutation in the ire/gene and/or a long terminal repeat (LTR) region or 
in a functionally equivalent location in the HTV-1 genome. 
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Still even more particularly, the present invention is directed to an isolated virus which: 
(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of HIV- 1; 
5 (ii) is substantially non-pathogenic in human subjects; 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a nef gene in said pathogenic strain of HTVM; and 

(iv) optionally carries a mutation in one or both LTR regions. 

10 In a related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
conditions to complementary nucleic acid from a pathogenic strain of MV-1; 

(ii) is substantially non-pathogenic in human subjects; 

.(iii) carries one or more deletion mutations in an LTR region of its genome; and 
15 (iv) optionally carries a mutation in a region corresponding to a nef gene in said 
pathogenic strain of HIV-L 

In a further related embodiment, there is provided an isolated virus which: 

(i) has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic add from a pathogenic strain of HTV-1; 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains nef coding sequences and LTR nucleotide sequences. 

25 In a particularly preferred embodiment, the present invention provides non-pathogenic 
HTV-1 isolate C18 deposited at the ECACC on 17 October, 1994 under Provisional 
Accession Number V94101706. 

In a related embodiment, the present invention provides non-pathogenic HIV-1 isolate 
30 C98 deposited at the ECACC on 3 1 October, 1994 under Provisional Accession Number 
V941031169. 
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In another embodiment, the present invention provides non-pathogenic HIV-1 isolate 
C54 deposited at ECACC on 14 February, 1 995 under Provisional Accession No. . 

Although pathogenicity is a relative term, it is used herein in relation to the capacity of 
5 a strain of HIV-1 to induce AIDS or AIDS-related disorders in an individual over time. 
Accordingly, a "non-pathogenic" strain of HIV-1 is a strain which, at die clinical level, 
does not lead to the development of AIDS, at least within the median time of 6-10 years 
following infection with HIV-1, At the laboratory level, a non-pathogenic strain of 
HIV-1 is considered not to alter GD4+- cell counts or ^-microglobulin concentrations. 

10 In addition, a non-pathogenic strain of HIV-1 may not alter CD8+ and CD3+ cell counts 
and would not alter lymphocyte counts. CD4+;CD8+ ratios also remain unchanged 
relative to normal non-infected individuals. Furthermore, generally, a non-pathogenic 
strain of HIV-1 does not induce p24 antigenaemia. A non-pathogenic HIV-1 of the 
present invention is generally still infectious but individuals infected with the virus 

IS remain free of symptoms for at least 6-10 years after infection. 

A laboratoiy classified non-pathogenic strain of HIV-1 may be determined at any time 
after infection. The term "non-pathogenic" is not to be considered as a strain that is 
never pathogenic under any conditions as this might depend on the host individual, the 

20 level of immune responsiveness in that individual and the extent or otherwise of other, 
for example, immune comprising disorders. Accordingly, a "non-pathogenic" HIV-1 
isolate of the present invention may also be considered a "low virulent" strain of the 
virus. A non-pathogenic strain of HIV-1 as contemplated herein may be isolated from 
an asymptomatic individual or may be derived from a pathogenic strain by mutation. 

25 Although the present invention is not to be limited to any particular pathogenic strain 
of HIV-1, for reference purposes, an example of a pathogenic strain is HTV-1 NL4-3 
strain as described by Myers et al (1992). 

The non-pathogenic nature of the HIV-1 of the present invention is conveniently 
30 evidenced by the cohort of seven individuals comprising one donor and six recipients 
which have remained free of symptoms or signs of HIV-1 infection for greater than the 
median time of 6-10 years. However, the individuals of the cohort are seropositive for 
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HTV-l following infection with the virus as determined by Western blot analysis and 
genetic analysis (e.g. using PCR techniques). A seropositive individual is one showing 
reactivity to at least one HTV-1 glycoprotein (such as but not limited to gp 41-45, gp!20, 
gpI60) and at least three other virus-specific bands. 

5 

In accordance with the present invention, a non-pathogenic HTV-1 isolate is also a strain 
of HIV-1 which: 

(I) induces an immune response in a human or primate subject; and 
(ii) does not substantially reduce proliferative responses or cytokine production to a 
10 mitogen, aUoantigen and/or recall antigen relative to a healthy, non-infected 

subject 

Preferably, the immune response such as to a glycoprotein, for example gp41-45, gpl20 
and/or gpl60. Preferably, the cytokine monitored is an interleukin, such as EL-2. 
IS Preferably, the recall antigen is influenza or tetanus toxoid. A non-pathogenic HIV-1 
isolate is also one which: 

(in) does not substantially alter proliferative responses or cytokine production to 
allorgeneic mononuclear cells. 

20 The genomes or complementary DNA therefrom of the non-pathogenic HTV-1 isolates 
of the present invention are capable of hybridising under medium stringency conditions 
to the corresponding genome or complementary DNA of a pathogenic strain of HTV-1 
(e.g. HTV-1 strain NL4-3). The ability to hybridise to a pathogenic strain of HIV-1 only 
applies to a comparison of the entire genome/complementary DNA of a non-pathogenic 

25 strain or a fragment which includes genetic material corresponding to a region in the 
genome 3' of the nef gene in a pathogenic strain of HTV-1. 

For the purposes of reference only, a suitable genomic nucleotide sequence from a 
pathogenic HTV-1 strain is set forth in SEQ ID NO: 1 from HTV-1 strain NL4-3 (Myers 
30 era/, 1992): 

1 TGGAAGGGCTAATTTGGTCC C^AAAAAGACAAGAGAT CCTTGATCTGTGG 
51 ATCTACCACACACAACX3CTACTTCCCT 
101 GCCAGGGATCAGATATCCACTGACCTTTGGATGGTGCTTC^ 
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151 CAGTTGAACCAGAGCAAGTAGAAGAGGCCAAATAAGGA^ 
201 TTGTTACACCCTATGAGCCAGCATGGGATGK3AGGACCCGGAGGGAGAAGT 
251 ATTAGTGTGK3AAGTTTGACAGCCTCCT 
301 AGCTGCATCCGGAGTACTACAAAGACTGCTGACATCGA 
5 351 GGACTTTCCX5CTGGGGACTTTCCAGGGAGGTGTGGCCTO 

401 GGAGTGGCGAGCCXTCAGATGCTAC3VTATAAGCAGCTO 
451 ACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCTA 
501 ACTAGGGAACCCACroCTTAAGCCTCAATAAAGCT 
551 AAGTAGTGTGTGCCCGTCTGTTGTGTGACTCTGGTAACTAGAGATCCCTC 

10 601 AGACCCTTTTAGTCAGTGTGGAAAATCTCTAGCAGTG^ 

651 GACTTGAAAGCGAAAGTAAAGCCAGAGGAGATCTCT^ 
701 GCTTGCTGAAGCGCGCACGGC7AGAGGCGAGGGGCGK3CGACTGGTGAGTA 
751 CGCCAAAAATTTTGACTAGCGGAGGOTAGAAGGAGAGAGATGGG 
801 AGCGTCGGTATTAAGCGGGGGAGAATTAGATAAATGGGAAAAAATTCGGT 

15 651 TAAGGCCAGGGGGAAAGAAACAATATAAACTAAAACATATAGTATGGGCA 

901 AGCAGGGAGCTAGAACGATTCX^AGTTAATCCrGGCC 
951 AGAAGGCTGTAGACAAATACTCGGACAGCT^ 
1001 GATOUSAAGAACTTAGATCATTATATA^ 

1051 GTG(^TCAAAGGATAGATGTAAAAGAC^CCAAGGAAGCCrrTAGATAAGAT 
20 1101 AQAGGAAGAGCAAAACAAAAGTAAGAAAAAGGC^ 

1151 ACACAGGAAACAACAGCCAGGTCAGCCAA 

1201 CTCCAGGGGCAAATGGTACATCAGGCCATATCACCTAG^ 

1251 ATGKK3TAAAAGTAGTAGAAGAGAAGGCTTTCAGCCCA 

1301 TGTTTTCAGCATTATCAGAAGGAGCCACCCC^ 
25 1351 CTAAACACAGTGGGGGGACATCAAGCAGCCATGCAAATC 

1401 CATCAATGAGGAAGCTGCAGAATGGGATAGATTCCATCCAGTC 

1451 GGCCTATTGCACCIAGGCCAGATGAGAGAACCAAG 

1501 GGAACTACTAGTACCOTTCAGGAACA^TAGGATGGAT^ 

1551 ACCTATCCCAGTAGGAGAAATCTATAAAAGATGG^TAATCCTGGGATTAA 
30 1601 ATAAAATAGTAAGAATGTATAGCCCTACCZAGCATTCTGQACATAAGACAA 

1651 GGACCAAAGGAACCCTTTAGAGACTATGTAGACCGATTCTATAAAACTCT 

1701 AAGAGCCGAGCAAGCTTCACAAflAGGT^ 

1751 TGTTGGTCCAAAATGCGAACCCAGATTGTAA^ 

1801 GGACCAGGAGCX^CACTAGAAGAAATGATGACAG 
35 1851 GGGACCCGGCCATAAAGCAAGAGTTTTGGCTGAAGCAATG 

1901 CAAATCCAGCTACTCATAATGATACAGAAAGG 

1951 AAGACTGTTAAGTGTTTCAATTCTGGCAAAG 

2001 TTGGAGK^CCCCTAGGAAAAAGGGCTGTTGGAAATGTGGA 
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2051 AC CAAATGAAAGATTGTACTGAGAGACAGG CTAATTTTTT AGGGAAGAT C 
2101 TGGCCTT CCCACAAGGGAAGGCCAGGGAATTTTCTT CAGAGCAGACCAGA 
2151 GCCAACAGCCCCACCAGAAGAGAGCnTCAGGTnGGGGAAGAGACAACAA 
2201 CTCCCnrTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAGCT 
5 2251 TCCCTCAGATCACTCTTTGGCAGCGACCCCTCGTCACAATAAAGATAGGG 

2301 GGGCAATTAAAGGAAGCTCTATTAGATACAGGAGCAGATGATACAGTATT 
2351 AGAAGAAATGAATTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGAA 
2401 TTGGAGGTTTTATCAAAGTAGGACAGTATGATCAGATACTCATAGAAATC 
2451 TGCGGACATAAAGiCTATAGGTAa\GTATTAGTAGQACCTACACCTGTCAA 

10 2501 CATAATTGGAAGAAATCTGTTGACTCAGATTGGCTGCACTTTAAATTTTC 

2551 CCATTAGT CCTATTGAGACTGTACCAGTAAAATTAAAGCCAGGAATGGAT 
2601 GGCCCAAAAGTTAAACIAATGGCCATTGACAGAAGAAAAAATAAAAGCATT 
2651 AGTAGAAATTTGTACAGAAATGGAAAAGGAAGGAAAAATTTCAAAAATTG 
2701 GGCCTGAAAATCCATACAATACTCXZAGTATTTGCC^TAAAGAAAAAAGAC 

IS 2751 AGTACTAAATGGAGAAAATTAGTAGATTTCAGAGAACTTAATAAGAGAAC 

2801 TC^GATTTCTGGGAAGTTCAATTAGGAATACCACATCCT 
2851 AACAGAAAAAATCAGTAACAGTACIX3GATGTGGGCGATGCATATTTTTCA 
2901 GTTCCCTTAGATAAAGACTTCAGGAAGTATACTGCATTTACCATACCTAG 
2951 TATAAACAATGAGACACCAGGGATTAGATAT CAGTACAATGTGCTTCCAC 

20 3001 AGGGATGGAAAGGAT CACCAGCAATATTCCAGTGTAGCATGACAAAAATC 

3051 TTAGAGCXTITTTAGAAAACAAAATCCAGACATAOT 

3101 GGATGATTTGTATGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAA 
3151 AAATAGAGGAACTGAGACAACATCTGTTGAGGTGGGGATTTACCACAC CA 
3201 GACAAAAAACATCAGAAAGAACCTCCATTCCTTTGGATGGGTTATGAACT 

25 3251 CCATCCTGATAAATGGACAGTACAGCCTATAGTGCTGCCAGAAAAGGACA 

3301 GCTGGACTGTCAATGACATACAGAAATTAGTGGGAAAATTGAATTGGGCA 
3351 AGTCAGATTTATGCAGGGATTAAAGTAAGGCAATTATGTAAACTTCTTAG 
3401 GQGAACCAAAGCACTAACAGAAGTAGTAOZACTAACAGAAGAAGCAGAGC 
3451 TAGAACTGGCAGAAAACAGGGAGATTCTAAAAGAACCX3GTACATGGAGTG 

30 3501 TATTATGACCCATCAAAAGACI^AATAGCAGAAATACAGAAGCAGGGGCA 

3551 AGGC CAATGGACATATCAAATTTATCAAGAGCCATTTAAAAATCTGAAAA 
3601 CAGGAAAATATGCAAGAATGAAGGGTGCCCACACTAATGATGTGAAACAA 
3651 TTAACAGAGGCAGTACAAAAAATAGCCACAGAAAGCATAGTAATATGGGG 
3701 AAAGACTCCTAAATTTAAATTACCCATACAAAAGGAAACATGGGAAGCAT 

35 3751 GGTGGACAGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGTC 

3801 AATACCCCTCCCTTAGTGAAGTTATGGTACCAGTTAGAGAAAGAACCCAT 
3851 AATAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCCAATAGGGAAACTA 
3901 AATTAGGAAAAGCAGGATATGTAACK^CAGAGGAAGACAAAAAGTTGTC 
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3951 CCCCTAACX5GACACAACAAATCAGAA 
4001 AGCTTTGCAGGATTCGGGATTAGAM 

4051 ATGCATTGGGAATCATTCAAGCACAACCAGATA^AGTGAATCAGAGTTA 
4101 GTCAGTCAAATAATAGAGGAGTTAATAAAAAAGGAAAAAGTCTACCTGGC 
S 4151 ATGGGTACCAGCAGAGAAAGGAATTGGAGGAAATGAACAAGTAGATGGGT 

4201 TGGTCAGTGCTGGAATCAGGAAAGTACTATTTTTAGATGGAATAGATAAG 
4251 GCCCAAGAAGAACATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAG 
4301 TGATTTTAACCTACCACCTGTAGTAGCAAAAQAAATAGTAGC 
4351 ATAAATGTCAGCTAAAAGGGGAAGCCATGCATGGACAAGTAGACTGTAGC 

10 4401 CCAGGAATATGGCAGCTAGATTGTACACATTTAGAAGGAAAAGTTATCTT 

4451 GGTAGCAGTTCATGTAGCCAGTGGATATAT^ 
4501 pVGAGACAGGGCAAG^AACAGCATACTTCCTCTTAAAA 
4551 TGGCCAGTAAAAACAGTACATACAGACAATGG^ 
4601 TACAGTTAAGraCC^CCTGTTGGTGGGCGGGGATC^ 

15 4651 TTCCXITACAATCCCCAAAGTCAAG 

4701 TTAAAGAAAATTATAGGACAGGTAAGAGATCAGGCT^ 

4751 AGCAGTACAAATGGCAGTATTCATCCACAATTTTAAAAGAAA 

4601 TTGGGGGGTACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGAC 

4851 ATACAAACTAAAGAATTACAAAAAC^ 

20 4501 GGTTTATTACAGGGACAGCAGAGATCCM 

4951 TCCTCTGGAAAGGTGAAGGGGCAGTAGTAATACA^ 

5001 AAAGTAGTGCCAAGAAGAAAAGGAAAGATCATCAGGGATTATGG 

5051 GATGGCAGGTGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAAC^^ 

5101 TGGAAAAGATTAGTAAAACACCATATGTATATTTCAAGGAAAGCTAAGGA 

25 5151 CTGGTTTTATAGACATCACTATGAAAGTACTAATC 

5201 AAGTACACATCCCACTAGGGGATGCTAAATT^ 
5251 GGTCTGCATACAGGAGAAAGAGACTGGCATTTGGGTCAGGGAGT 
5301 AGAATGGAGGAAAAAGAGATATAGCAC^CAAGTAGArc 
5351 ACCAACTAATTCATCTGCACTATTTT^ 

30 5401 AGAAATACCATATTAGGACGTATAGTTAGTCCTAGGTGTGAATATCAAGC 

5451 AGGACATAACAAGGTAGGATCTCTACAGTAOTTGGCACTAGC^ 
5501 TAAAACCAAAACAGATAAAGCCACCTTTGCCTAGTGTTAGGAAACT^ 
5551 GAGGACAGATGGAACAAGCCCCAGAAGACCAAGGGCCZACAGAGGGAGCCA 
5601 TACAATGAATGGACACTAGAGCTTTTAGAGGAACTT 

35 5651 TAGACATTTTCCTAGGATATGGCTCCATAACTTAGGACAACATATCTATG 

5701 AAACTTACGGGGATACTTGGGCAGGAGTGGAAGCCATAATAAGAATT 
5751 CAACAACTGCTGTTTATCCATTTCAGAATTGGG^ 
5801 AGGCGTTACTCGACAGAGGAGAGCAAGAAATGGAGCC^TAGATCCTA^ 
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5851 CTAGAGCCCTGGAAGCATCCAGGAAQTCAGCCTAAAACTGCTTGTACCAA 
5901 TTGCTATTGTAAAAAGTGTTGCTTTCATTGCCAAGTTTGTTTCATGACAA 
5951 AAG CCTTAGGCAT CTCCTATGGCAGGAAGAAG CGGAGACAGCGACGAAGA 
6001 GCTCATCAGJACAGTCAGACTCATCAAGCTTCTCTATCAAAGCAGTAAGT 
5 6051 AGTACATGTAATGCAACCTATAATAGTAGCAATAGTAGCATTAGTAGTAG 

6101 CAATAATAATAGCAATAGTTGTGTGGTCCATAGTAATCATAGAATATAGG 
6151 AAAATATTAAGACAAAGAAAAATAGACAGGTTAATTGATAGACTAATAGA 
6201 AAGAGCAGAAGACAGTGGCAATGAGAGTGAAGGAGAAGTATCAG CACTTG 
6251 TGGAGATGGGGGTGGAAATGGGGCACCATGCTCCTTGGGATATTGATGAT 
10 6301 CTGTAGTGCTACAGAAAAATTGTGGGTCACAGTCTATTATGGGGTACCTG 

6351 TGTGGAAGGAAGCAACCACC^CrCTATTTTGTGCATCAGATGCTAAAGCA 
6401 TATGATACAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTACCCAC 
6451 AGACCCCAACCCACAAGAAGTAGTATTGGTAAATGTGACAGAAAATTTTA 
6501 ACATGTGGAAAAATGACATGGTAGAACAGATGCATGAGGATATAATCAGT 

IS 6551 TTATGGGATCAAAGCCTAAAGCCATGTGTAAAATTAACCCCACT CTGTGT 
6601 TAGTTTAAAGTCCACTGATTTGAAGAATGATACTAATACCAATAGTAGTA 
6651 GCGGGAGAATGATAATGGAGAAAGGAGAGATAAAAAACTGCTCTTTCAAT 
6701 ATCAGCACAAGCATAAGAGATAAGGTGCAGAAAGAATATGCATTCTTTTA 
6751 TAAACTTGATATAGTACCAATAGATAATACCAGCTATAGGTTGATAAGTT 

20 6801 GTAACACCTCAGTCATTACACAGGCCTGTCCAAAGGTATCCrTTGAGCCA 

6851 ATTCCCATAGATTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAA 
6901 TAATAAGACGTTCAATGGAACAGGACC^TGTACAAATGTCAGCACAGTAC 
6951 AATGTACACATGGAATCAGGCCAGTAGTATGAACTCAACTGCTGTTAAAT 
7001 GGCAGTCTAGCAGAAGAAGATGTAGTAATTAGATCTGCCAATTTCACAGA 

25 7051 CAATGCTAAAACCATAATAGTACAGCTGJUICACAT 

7101 GTACAAGACCCAACAACAATACAAGAAAAAGTATCCX3TATCCAGAGGGGA 
7151 CCAGGGAGAGCATTTGTTACAATAGGAAAAATAGGAAATATGAGACAAGC 
7201 ACATTGTAACATTAGTAGAGCAAAATGGAATGCCACTTTAAAACAGATAG 
7251 CTAGCAAATTAAGAGAACAATTTGGAAATAATAAAACAATAATCITTAAG 

30 7301 CAATCCTCAGGAGGGGACCCAGAAATTGTAACGCACAGTnTAATTGTGG 

7351 AGGGGAATTTTTCTACTGTAATT CAACACAACTGTTI7ATAGTACTTGGT 
7401 TTAATAGTACTTGGAGTACTGAAGGGT CAAATAACACTGAAGGAAGTGAC 
7451 ACAATCACACTCCCATGCAGAATAAAACAATTTATAAACATGTGGCAGGA 
7501 AGTAGGAAAAGCAATGTATGC^CTCCCATCAGTGGACAAATTAGATGTT 

35 7551 CATCAAATATTACTGGGCTGCTATTAACAAGAGATGGTGGTAATAACAAC 

7601 AATGGGTCCGAGATCTTCAGACCTGGAGGAGGCGATATGAGGGACAATTG 
7651 GAGAAGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAG 
7701 TAGCArcCaCCAAGGCAAAGAGAAGAGTGGTGCAGAGAGAAAAAAGAGCA 
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7751 GTGGGAATAGGAGCTTTGTTCCTTGGGTTCT 
7801 TATGGGCTGCACGTCAATGACGCTGACGGTACAGTC 
7851 CTGATATAGTGCAGCAGCAGAACAATT^ 
7901 CAGCATCTGTTGCAACTCACAGTCTGG 
5 7951 AATCCTGGCTGTGGAAAGATACCTAAAGGATCAACAG CTCCTGGGGATTT 

8001 GGGGTTGCTCTGGAAAACTCATTTGCACCACTGCTGTC 
8051 AGTTGGAGTAATAAATCTCTGGAAGAGATTTGGAAT 
8101 GGAGTGGGACAGAGAAATTAACAATTACACAAGCT^ 
8151 TTGAAGAATCGCAAAACCAGCAAGAAAAGAAT^ 
10 8201 TTAGATAAATGGGCAAGTTTGTGGAATTTC 

8251 GTGGTATATAAAATTATTCATAATGATACT 
8301 GAATAGTTTTTGCTGTACTTTCTATAGTG^ 
8351 TCACCATTATCGTTTCAGA^^ 

8401 GCCCGAAGGAATAGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATC^ 
15 8451 TTCGATTAGTGAACGGATCCTTAGGACTTA 

8501 CTGTGCCTCTTCAGCTACCACCGCTTGAGAGACra 

8551 GAGGATTGTGGAACTTCTGGGACGCAGGGGGTGGGAAGCCCTCAAATA 

8601 GGTGGAATCTCCTACAGTATTGGAGTCAGGAACTAAAG^ 

8651 AACTTGCTCAATGCCACAGCCATAGCAGTAGCT 
20 8701 TATAGAAGTATTACAAGCAGCTTATAGAGCTATTCGCCACATACCTAG^ 

8751 GAATAAGACAGGGCTTGGAAAGGATTTTGCTATA^ 

8801 GTCAAAAAGTAGTGTGATTGGATGGCCTGCTO^ 

8851 GAGCTGAGCCAGCAGCAGATGGGGTGGGAGCA^ 

8901 AAACATGGAGCAATCACAAGTAGCAM 
25 8951 TGCCTGGCTAGAAGCACAAGAGGAGGAAGAGGTGGGTTTT 

9001 CTGAGGTACCTTTAAGACGAATGACTTACAAG 

9051 CACTTTTTAAAAGAAAAGGGGGGACTGGAAGGG 

9101 AAGACAAGATATCCTTGATCTGTGGATCT 

9151 CTGATTGGCAGAACTACACACCA^ 
30 9201 TTTGGATGGTGCTACAAGCTAGTAC^ 

9251 GGCCAATAAAGGAGAGAACACCZAGCTTGTTACACCCT 

9301 GAATGGATGACCCTGAGAGAGAAGTGTTW 

9351 CTAGCMTTCATCACGTGGCC^ 

9401 CTGCTGACaTCXaAGCTTGCTACAAGGG^CTTTCCGCT^ 
35 9451 GGAGGCGTGGCCTGGGCXSGGACTGaOGAGTGGCGAGCCCT 

9501 ATATAAGCAGCTG L U 1 TTT TGCCTGTACTGGGTCTCTCTG 
9551 TCTGAGCCTGGGAGCrCTCTGGCTAACT 
9601 CAATAAAGCTTGCCTTGAGTGCrrCAM 
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9651 TQACTCTGGTAACTAQAGATCCCTCAGACCCTTTT 
9701 TCTCTAGCA 

However, for the purposes of comparing the nucleotide sequences of non-pathogenic 
S HTV-1 strains including the ability to hybridise to a reference strain, the present 
invention extends to a genomic nucleotide sequence from any pathogenic strain of HTV- 
1. 

Accordingly, in a particularly preferred embodiment, there is provided a viral isolate 
10 which: 

(i) carries a genome which is capable of hybridising under medium stringency 
conditions to SEQ ID NO: 1 or a complementary form thereof or an analogous 
sequence from another pathogenic strain of HTV-1; and 

(ii) carries a deletion mutation in a region corresponding to die nef gene and/of in 
15 an LTR region. Generally, such an HTV-1 isolate is non-pathogenic as 

hereinbefore defined 

In a related embodiment, there is provided an isolated virus which: 
(0 has a genome which is capable of hybridising under medium stringency 
20 conditions to complementary nucleic acid from a pathogenic strain of HTV-1; 

and 

(ii) carries one or more deletion mutations in a region of its genome corresponding 
to a region which contains nt/coding sequences and LTR nucleotide sequences. 

25 For the purposes of defining die level of stringency, reference can conveniently be made 
to Maniatis et al (1982) at pages 387-389 which is herein incorporated by reference 
where the washing steps disclosed are considered high stringency. A low stringency is 
defined herein as being in 1-3X SSC/0.1-0.5% w/v SDS at 37-50°C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridisation, 

30 alternative conditions of stringency may be employed such as medium stringent 
conditions which are considered herein to be 0.1 -IX SSC/0.25-0.5% w/v SDS at £ 45°C 
for 2-3 hours or high stringent conditions considered herein to be 0. MX SSC/0.1% w/v 
SDS at 60°C for 1-3 hours. 
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In a particularly preferred embodiment of the present invention, the non-pathogenic 
strain of HTV-1 carries a mutation in the nef gene and/or LTR region of the genome. 

A "mutation" is considered herein to include a single or multiple nucleotide substitution, 
5 deletion and/or addition. Most preferred mutations are single or multiple deletions of 
at least one, most preferably at least ten and even more preferably at least twenty 
contiguous nucleotides from a region corresponding to the nef gene and/or the LTR 
region. When the non-pathogenic virus carries a mutation in the LTR region, this 
generally occurs 5' of the Spl sites. 

10 

According to a preferred aspect of the present invention, there is provided a viral isolate 
winch: 

(i) is reactive to antibodies to a glycoprotein from HTV-1 such as at least one of 
gp41-45, gpl20 and/or gpl60; 

15 (ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the nef gene and/or LTR region of a pathogenic strain of HTV-1. 

In another embodiment, there is provided a viral isolate which: 
20 (i) is capable of inducing an immune response to at least one of gag pel and/or emr 9 

(ii) is substantially non-pathogenic in human subjects; and 

(iii) carries a deletion mutation of at least ten nucleotides in a region corresponding 
to the nef gene and/or LTR region of a pathogenic strain of HTV-1. 

25 Preferably, in respect of the latter embodiment, the immune response is an antibody or 
a cell mediated response. In a most preferred embodiment, the immune response is a 
humoral immune response. 

The nucleotide sequence of the m/gene in HIV-1 NL4-3 is defined in SEQ ID NO: 
30 650: 

ATGGGTGGCAAGTGGTOtf^ 
GAGAQGAGCTGAGCCAGC^ 

ATGGAGCAATC^CAAGTAGCAATACAGCAGCTAAC^ 
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GCACAAGAGGAGGAAGAGGTGGGTTTTCCAGTCACACCT 
GACTTACAAGGCAGCTGTAGATCTTAGCCAC^^ 
GGCTAATTCACTCCCAAAGAAGACAAGATATCCTTGATCTGT 
GGCTACTTCCCTGATTGGCAGAACTACACACCAGGGCCAGGGG 
5 CTTTGGATGGTGCTACAAGCTAGT^^ 
AAGGAGAGAACACCAGCTTGTTACAC^ 
AGAGAAGTGTTAGAGTGGAGGTTTGAC^^ 
GCTGCATCCGGAGTACTTCAAGAACTGCTGA 

10 The present invention extends to any or all single or multiple nucleotide deletions to a 
contiguous series of at least ten nucleotides from the ne/gene which render the strain 
avirulent The deletions may encompass the entire gene or parts thereof and may 
represent a single deletion or two or more deletions. Put in alternative terms, the non- 
pathogenic HIV-1 isolates of the present invention comprise a nucleotide sequence at the 

IS corresponding nef gene region non-identifiable to SEQ ID NO: 650, said non-identity 
comprising at least 5%, more preferably at least 10% and even more preferably at least 
20% variation thereon. 

In a preferred embodiment, therefore, the present invention contemplates a viral isolate 
20 which: 

(i) is reactive to antibodies to a glycoprotein from HIV-1 such as at least one of 
gp4M5, gp!20 and/or gpl60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 

25 NO: 1 or a complementary form thereof or an analogous sequence from another 

pathogenic strain of HTV-1; 
(iu) carries a deletion of at lea& ten nucleotides in a region corresponding to the nef 

gene in HTV-1 NL4-3; and 
wherein said deletion encompasses one or more of the following decanucleotides from 
30 thene/geae of HIV-1 NL4-3 or rorre^ponding sequences from anotiio-pathog«iic strain 
of HIV-1: 



ATGGGTGGCA (SEQ ID NO: 2) 
GGGTGGCAAG (SEQ ID NO: 4} 
GTGGCAAGTG (SEQ ID NO: 6) 



TGGGTGGCAA { SEQ ID NO: 3) 
GGTGGCAAGT (SEQ ID NO: 5) 
TGGCAAGTGG (SEQ ID NO: 7) 
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GGCAAGTGGT (SEQ ID NO: 8); 
CAAGTGQTCA ( SEQ ID NO: 10) ; 

AGTGGTCAAA ( SEQ ID NO: 12) ; 

TGGTCAAAAA ( SEQ ID NO: 14) ; 
5 QTCAAAAAGT (SEQ ID NO: 16); 

CAAAAAGTAG ( SEQ ID NO: 18); 

AAAA6TAGTG ( SEQ ID NO: 20); 

AAGTA6TGTG ( SEQ ID NO: 22); 

6TAGT6T6AT ( SEQ ID NO: 24); 
10 AGTGT0ATT6 (SEQ ID NO: 26); 

TGTGATTG6A ( SEQ ID NO: 28); 

TGATTGQATG (SEQ ID NO: 30); 

ATTGGATGGC ( SEQ ID NO: 32); 

TGGATGGCCT ( SEQ ID NO: 34); 
15 GATGGCCTGC ( SEQ ID NO: 36); 

TGGCCTGCTG { SEQ ID NO: 38); 

6CCT6CT6TA(SEQ ID NO: 40) ; 

CTGCTGTAAG ( SEQ ID NO: 42); 

GCTGTAAGGG ( SEQ ID NO: 44); 
20 TGTAAGGGAA ( SEQ ID NO: 46); 

TAAGGGAAAG ( SEQ ID NO: 48); 

AGGGAAA6AA ( SEQ ID NO: SO); 

GGAAAGAATG ( SEQ ID NO: 52); 

AAAGAATGAG (SEQ ID NO: 54}; 
25 AGAATGAGAC ( SEQ ID NO: 56); 

AATGAGACGA ( SEQ ID NO: 58) ; 

TGAGACGAGC ( SEQ ID NO: 60) ; 

AGACGAGCTG (SEQ ID NO: 62); 

ACGAGCTGAG ( SEQ ID NO: 64); 
30 GAOCTGAGCC(SEQ ID NO: 66); 

6CTGAGCCAG(SEQ ID NO: 68); 

TGAGCCAGCA ( SEQ ID NO: 70); 

AGCCAGCAGC (SEQ ID NO: 72); 

CCAGCAGCAG(SEQ ID NO: 74); 
35 AGCAGCAGAT ( SEQ ID NO: 76); 

CA6CAOATGO(SEQ ID NO: 78); 

GCAGATGGGG ( SEQ ID NO: 80); 

AGATGGGGTG ( SEQ ID NO: 82); 
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GCAAGTGGTC (SEQ 


ID 


NO: 


9); 


AAGTGGTCAA (SEQ 


ID 


NO: 


11) 


GTGGTCAAAA ( SEQ 


ID 


NO: 


13) 


GGTCAAAAAG (SEQ 


ID 


NO: 


15) 


TCAAAAAGTA (SEQ 


ID 


NO: 


17) , 


AAAAAGTAGT (SEQ 


ID 


NO: 


19) i 


AAAGTAGTGT (SEQ 


ID 


NO: 


21) i 


AGTAGTGTGA (SEQ 


ID 


NO: 


23) ; 


TAGTGTGATT (SEQ 


ID 


NO: 


25) ; 


GTGTGATTGG (SEQ 


ID 


NO: 


27) ; 


GTGATTGGAT (SEQ 


ID 


NO: 


29) ; 


GATTGGATGG (SEQ 


ID 


NO: 


31); 


TTGGATGGCC (SEQ 


ID 


NO: 


33); 


GGATGGCCTO (SEQ 


ID 


NO: 


35); 


ATGGCCTGCT (SEQ 


ID 


NO: 


37) ; 


GGCCTGCTGT (SEQ 


ID 


NO: 


39) ; 


CCTGCTGTAA (SEQ 


ID 


NO: 


41); 


TGCTGTAAGG (SEQ 


ID 


NO: 


43); 


CTGTAAGGGA ( SEQ 


ID 


NO: 


4.5); 


GTAAGGGAAA ( SEQ 


ID 


NO: 


47) ; 


AAGGGAAAGA (SEQ 


ID 


NO: 


49); 


GGGAAAGAAT (SEQ 


ID 


NO: 


51) ; 


GAAAGAATGA ( SEQ 


ID 


NO: 


53) ; 


AAGAATGAGA (SEQ 


ID 


NO: 


55) ; 


GAATGAGACG (SEQ 


ID 


NO: 


57) ; 


ATGAGACGAG (SEQ 


ID 


NO: 


59) ; 


GAGACGAGCT (SEQ 


ID 


NO: 


61); 


GACGAGCTGA (SEQ 


ID 


NO: 


63); 


CGAGCTGAGC (SEQ 


ID 


NO: 


65); 


AGCTGAGCCA (SEQ 


ID 


NO: 


67) ; 


CTGAGCCAGC (SEQ 


ID- 


NO: 


69) ; 


GAGCCAGCAG (SEQ 


ID 


NO: 


71); 


GCCAGCAGCA (SEQ 


ID 


NO: 


73) ; 


CAGCAGCAGA ( SEQ 


ID 


NO: 


75) ; 


GCAGCAGATG (SEQ 


ID 


NO: 


77) ; 


AGCAGATGGG (SEQ 


ID 


NO: 


79) ; 


CAGATGGGGT ( SEQ 


ID 


NO: 


81); 


GATGGGGTGO (SEQ 


ID 


NO: 


83) ; 
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ATGGGGTGGG (SEQ ID NO: 84); 

GGGGTGGGAG (SEQ . ID NO : 86); 

GGTGGGAGCA ( SEQ ID NO: 88); 

TGGGAGCAGT (SEQ ID NO: 90); 
S GGAGCAGTAT (SEQ ID NO: 92); 

AGCAGTATCT (SEQ ID NO: 94); 

CAGTATCTC6 (SEQ ID NO: 96); 

GTATCTCGAG (SEQ ID NO: 98); 
ATCTCGAGAC (SEQ ID NO: 100); 
10 CTCGAGACCT (SEQ ID NO: 102); 
CGAGACCTAG (SEQ ID NO: 104); 
AGACCTAGAA ( SEQ ID NO: 106); 
ACCTAGAAAA ( SEQ ID NO: 108) ; 
CTAGAAAAAC (SEQ ID NO: 110); 
15 AGAAAAACAT (SEQ ID NO: 112); 
AAAAACATGG (SEQ ID NO: 114) ; 
AAACATGGAG ( SEQ ID NO: 116); 
ACATGGAGCA ( SEQ ID NO: 118); 
ATGGAGCAAT ( SEQ ID NO: 120); 
20 GGAGCAATCA ( SEQ ID NO: 122) ; 
AGCAATCACA ( SEQ ID NO: 124) ; 
CAATCACAAG ( SEQ ID NO: 126) ; 
ATCACAAGTA ( SEQ ID NO: 128) ; 
CACAAGTAGC ( SEQ ID NO: 130) ; 
25 CAAGTAGCAA (SEQ ID NO: 132} ; 
AGTAGCAATA (SEQ ID NO: 134) ; 
TAGCAATACA (SEQ ID NO: 136); 
GCAATACAGC (SEQ ID NO: 138); 
AATACAGCAG ( SEQ ID NO: 140); 
30 TACAGCAGCT ( SEQ ID NO: 142); 
CAGCAGCTAA(SEQ ID NO: 144) ; 
GCAGCTAACA ( SEQ ID NO: 146); 
AGCTAACAAT (SEQ ID NO: 148); 
CTAACAATGC ( SEQ ID NO: 150); 
35 AACAATGCTG ( SEQ ID NO: 152); 
CAATGCTGCT (SEQ ID NO: 154); 
ATGCTGCTTG (SEQ ID NO: 156) ; 
GCTOCTTGTG (SEQ ID NO: 158); 
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TGGGGTGGGA (SEQ 


ID 


NO: 


85) ; 


GGGTGGGAGC (SEQ 


ID 


NO: 


87) ; 


GTGGGAGCAG (SEQ 


ID 


NO: 


89) ; 


GGGAGCAGTA ( SEQ 


ID 


NO: 


91) ; 


QAGCAGTATC (SEQ 


ID 


NO: 


93) ; 


GCAGTATCTC (SEQ 


ID 


NO: 


95) ; 


AGTATCTCGA ( SEQ 


ID 


NO: 


97) ; 


TATCTCGAGA ( SEQ 


ID 


NO: 


99) ; 


TCTCGAGACC (SEQ 


ID 


NO: 


101) ; 


TCGAGACCTA (SEQ 


ID 


NO: 


103) ; 


GAGACCTAGA ( SEQ 


ID 


NO: 


105) ; 


GACCTAGAAA (SEQ 


ID 


NO: 


107) ; 


CCTAGAAAAA ( SEQ 


ID 


NO: 


109) ; 


TAGAAAAACA (SEQ 


ID 


NO: 


HI) ; 


GAAAAACATG ( SEQ 


ID 


NO: 


113) ; 


AAAACATGGA ( SEQ 


ID 


NO: 


115) ; 


AACATGGAGC (SEQ 


ID 


NO: 


117) ; 


CATGGAGGAA (SEQ 


ID 


NO: 


119) ; 


TGGAGCAATC (SEQ 


ID 


NO: 


121} ; 


GAGCAATCAC (SEQ 


ID 


NO: 


123) ; 


GCAATCACAA ( SEQ 


ID 


NO: 


125} ; 


AATCACAAGT ( SEQ 


ID 


NO: 


127) ; 


TCACAAGTAG (SEQ 


ID 


NO: 


129) ; 


ACAAGTAGCA (SEQ 


ID 


NO: 


131) ; 


AAGTAGCAAT ( SEQ 


ID 


NO: 


133) ; 


GTAGCAATAC (SEQ 


ID 


NO: 


135) ; 


AGCAATACAG (SEQ 


ID 


NO: 


137} ; 


CAATACAGCA (SEQ 


ID 


NO: 


139) ; 


ATACAGCAGC (SEQ 


ID 


NO: 


141) ; 


ACAGCAGCTA (SEQ 


ID 


NO: 


143} ; 


AGCAGCTAAC (SEQ 


ID. 


NO: 


145} ; 


CAGCTAACAA ( SEQ 


ID 


NO: 


147) ; 


GCTAACAATG ( SEQ 


ID 


NO: 


149) ; 


TAACAATGCT ( SEQ 


ID 


NO: 


151) ; 


ACAATGCTGC ( SEQ 


ID 


NO: 


153} ; 


AATGCTGCTT (SEQ 


ID 


NO: 


155); 


TGCTGCTTGT (SEQ 


ID 


NO: 


157) ; 


CTGCTTGTGC (SEQ 


ID 


NO: 


159) ; 
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TGCTTGTGCC (SEQ ID NO: 160 
CTTGTGCCTG (SEQ ID NO: 162 
TGTGCCT6GC ( SEQ ID NO: 164 
TGCCTGGCTA ( SEQ ID NO: 166 
5 CCTGGCTAGA(SEQ ID NO: 168 
TGGCTAGAAG (SEQ ID NO: 170 
6CTAGAA6CA (SEQ ID NO: 172 
TAGAAGCACA (SEQ ID NO: 174! 
GAAGCACAAG (SEQ ID NO: 176 

10 AGCACAAGAG (SEQ ID NO: 178: 
CACAAGAGGA(SEQ ID NO: 180! 
CAAGAGGAGG (SEQ ID NO: 182! 
AGAGGAGGAA ( SEQ ID NO: 184] 
AGGAGGAAGA (SEQ ID NO: 186] 

IS GAGGAAGAGG ( SEQ ID NO: 188] 
GGAAGAGGTG(SEQ ID NO: 190] 
AAGAGGTGGG (SEQ ID NO: 192] 
GAGGTGGGTT (SEQ ID NO: 194] 
GGTGGGTTTT (SEQ ID NO: 196] 

20 TGGGTTTTCC (SEQ ID NO: 198] 
GGTTTTCCAG (SEQ ID NO: 200] 
TTTTCCAGTC (SEQ ID NO: 202] 
TTCCAGTCAC (SEQ ID NO: 204] 
CCAGTCACAC(SEQ ID NO: 206] 

25 AGTCACACCTtSEQ ID NO: 208] 
TCACACCTCA(SEQ ID NO: 210] 
ACACCTCAGG ( SEQ ID NO: 212] 
ACCTCAGGTA (SEQ ID NO: 214] 
CTCAGGTACC (SEQ ID NO: 216] 

30 CAGGTACCTT (SEQ ID NO: 218] 
GGTACCTTTA (SEQ ID NO: 220] 
TACCTTTAAG (SEQ ID NO: 222] 
CCTTTAAGAC (SEQ ID NO: 224] 
TTTAAGACCA (SEQ ID NO: 226] 

35 TAAGACCAAT (SEQ ID NO: 228] 
AGACCAATGA ( SEQ ID NO: 230] 
ACCAATGACT ( SEQ ID NO: 232] 
CAATGACTTA ( SEQ ID NO: 234] 
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• 


GCTTGTGCCT (SEQ 


ID 


NO: 


161) j 


7 


TTGTGCCTGG (SEQ 


ID 


NO: 


163) ; 


9 


GTGCCTGGCT (SEQ 


ID 


NO: 


165) ; 


S 


GCCTGGCTAG (SEQ 


ID 


NO: 


167) ; 


t 


CTGGCTAGAA (SEQ 


ID 


NO: 


169) ; 




GGCTAGAAGC (SEQ 


ID 


NO: 


171) ; 


; 


CTAGAAGCAC (SEQ 


ID 


NO: 


173) ; 


* 


AGAAGCACAA CSEQ 


ID 


NO : 


» mm m~ \ 

175) ; 




AAGCACAAGA ( SEQ 


ID 


NO: 


177) ; 


§ 


GCACAAGAGG ( SEQ 


ID 


NO: 


179) ; 




ACAAGAGGAG { SEQ 


ID 


NO: 


181} ; 


m 
f 


AAGAGGAGGA (SEQ 


ID 


NO: 


183) ; 


a 

» 


GAGGAGGAAG (SEQ 


ID 


NO: 


185) ; 


; 


GGAGGAAGAG (SEQ 


ID 


NO: 


187) ; 




AGGAAGAGGT (SEQ 


ID 


NO: 


189) / 




GAAGAGGTGG ( SEQ 


ID 


NO: 


191) ; 




AGAGGTGGGT(SEQ 


ID 


NO: 


193); 




AGGTGGGTTT (SEQ 


ID 


NO: 


195) ; 




GTGGGTTTTC (SEQ 


ID 


NO: 


197) ; 




GGGTTTTCCA (SEQ 


ID 


NO: 


199) ; 




GTTTTCCAGT (SEQ 


ID 


NO: 


201) ; 




TTTCCAGTCA (SEQ 


ID 


NO: 


203) ; 




TCCAGTGACA ( SEQ 


ID 


NO: 


205) ; 




CAGTCACACC(SEQ 


ID NO: 


207) ; 


; 


GTCACACCTC(SEQ 


ID 


NO: 


209) ; 


9 


CACACCTCAG(SEQ 


ID 


NO: 


211) ; 


; 


CACCTCAGGT (SEQ 


ID 


NO: 


213), 


; 


CCTCAGGTAC (SEQ 


ID 


NO: 


215); 




TCAGGTACCT (SEQ 


ID 


NO: 


217) ; 




AGGTACCTTT (SEQ 


ID 


NO: 


219} ; 




GTACCTTTAA (SEQ 


ID 


NO: 


221) ; 




ACCTTTAAGA ( SEQ 


ID 


NO: 


223) ; 




CTTTAAGACC (SEQ 


ID NO: 


225) ; 




TTAAGACCAA ( SEQ 


ID 


NO: 


227) ; 




AAGACCAATG ( SEQ 


ID 


NO: 


229) ; 




GACCAATGAC (SEQ 


ID 


NO: 


231) ; 




CCAATGACTT (SEQ 


ID 


NO: 


233) ; 




AATGACTTAC (SEQ 


ID 


NO: 


235) ; 
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ATGACTTACA ( SEQ ID NO: 
GACTTACAAO ( SEQ ID NO: 
CTTACAAGGC (SEQ ID NO: 
TACAAGGCAG (SEQ ID NO: 
5 CAAGGCAGCT (SEQ ID NO: 
AG6CAGCTGT ( SEQ ID NO: 
GCAGCTGTAG(SEQ ID NO: 
A6CT6TA6AT (SEQ ID NO: 
CTGTAGATCT (SEQ ID NO: 
10 GTAGATCTTA (SEQ ID NO: 
AGATCTTAGC (SEQ ID NO: 
ATCTTAGCCA ( SEQ ID NO: 
CTTAGCCACT (SEQ ID NO: 
TAGCCACTTT (SEQ ID NO : 
15 GCCACTTTTT (SEQ ID NO: 
CACTTTTTAA { SEQ ID NO: 
CTTTTTAAAA ( SEQ ID NO: 
TTTTAAAAGA (SEQ ID NO: 
TTAAAAGAAA (SEQ ID NO: 
20 AAAAGAAAAG (SEQ ID NO: 
AAGAAAAGGG (SEQ ID NO: 
GAAAAGGGGG (SEQ ID NO: 
AAAGGGGGGA (SEQ ID NO: 
AGGGGGGACT (SEQ ID NO: 
25 GGGGGACTGG (SEQ ID NO: 
GGGACTGGAA { SEQ ID NO: 
GACTGGAAGG (SEQ ID NO: 
CTGGAAGGGC (SEQ ID NO: 
GGAAGGGCTA (SEQ ID NO: 
30 AAGGGCTAAT (SEQ ID NO: 
GGGCTAATTC (SEQ ID NO: 
GCTAATTCAC (SEQ ID NO: 
TAATTCACTC (SEQ ID NO: 
ATTCACTCCC (SEQ ID NO: 
35 TCACTCCCAA(SEQ ID NO: 
ACTCCCAAAG (SEQ ID NO: 
TCCCAAAGAAfSEQ ID NO: 
CCAAAGAAGA (SEQ ID NO: 
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236} ; TGACTTACAA (SEQ ID NO: 237} , 

239) ; ACTTACAAGG (SEQ ID NO: 239} ; 

240) ; TTACAAGGCA ( SEQ ID NO: 241); 
242); ACAAGGCAGC (SEQ ID NO: 243); 
244); AAGGCAGCTG(SEQ ID NO: 245); 
246) ; GGCAGCTGTA ( SEQ ID NO: 247) ; 
248) ; CAGCTGTAGA(SEQ ID NO: 249} ; 
250); GCTGTAGATC (SEQ ID NO: 251); 
252); TGTAGATCTT (SEQ ID NO: 253); 
254) ; TAGATCTTAG (SSQ ID NO: 255} ; 
256); GATCTTAGCC (SEQ ID NO: 257); 
258); TCTTAGCCAC (SEQ ID NO: 259); 
260); TTAGCCACTT (SEQ ID NO: 261); 
262) ; AGCCACTTTT(SEQ ID NO: 263) ; 
264) ; CCACTTTTTA ( SEQ ID NO: 265) ; 
266) ; ACTTTTTAAA (SEQ ID NO: 267) ; 
268); TTTTTAAAAG (SEQ ID NO: 269); 
270); TTTAAAAGAA (SEQ ID NO: 271); 
272); TAAAAGAAAA (SEQ ID NO: 273); 
274); AAAGAAAAGG (SEQ ID NO: 275); 
276) ; AGAAAAGGGG (SEQ ID NO: 277); 
278); AAAAGGGGGG (SEQ ID NO: 279); 
280); AAGGGGGGAC (SEQ ID NO: 281); 
282); GGGGGGACTG(SEQ ID NO: 283); 
284} ; GGGGACTGGA ( SEQ ID NO: 285} ; 
286); GGACTGGAAG (SEQ ID NO: 287); 
288); ACTGGAAGGG (SEQ ID NO: 289); 
290); TGGAAGGGCT (SEQ ID NO: 291); 
292); GAAGGGCTAA(SEQ ID NO: 293); 
294); AGGGCTAATT (SEQ ID NO: 295); 
296); GGCTAATTCA (SEQ ID NO: 297); 
298); CTAATTCACT (SEQ ID NO: 299); 
300) ; AATTCACTCC ( SEQ ID NO: 301) ; 
302); TTCACTCCCA { SEQ ID NO: 303); 
304) ; CACTCCCAAA(SEQ ID NO: 305) ; 
306) ; CTCCCAAAGA ( SEQ ID NO: 307) ; 
308); CCCAAAGAAG (SEQ ID NO: 309); 
310); CAAAGAAGAC (SEQ ID NO: 311); 
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AAAGAAGACA (SEQ ID NO: 312); 
AGAAGACAAG (SEQ ID NO: 314); 

AAGACAAGAT (SEQ ID NO: 316); 
GACAAGATAT (SEQ ID NO: 318); 
5 CAAQATATCC (SEQ ID NO: 320}; 

AGATATCCTT (SEQ ID NO: 322); 

ATATCCTTGA (SEQ ID NO: 324) ; 

ATCCTTGATC (SEQ ID NO: 326); 

CCTTGATCTG (SEQ ID NO: 328); 
10 TTGATCTGTO (SEQ ID NO: 330) 

GATCTGTGGA ( SEQ ID NO: 332) 

TCTGTGGATC (SEQ ID NO: 334) 

TGTGGATCTA (SEQ ID NO: 336) 

TG6ATCTACC (SEQ ID NO: 338) 
IS GATCTACCAC (SEQ ID NO: 340) 

TCTACCACAC (SEQ ID NO: 342) 

TACCACACAC (SEQ ID NO: 344) 

CCACACACAA(SEQ ID NO: 346) 

ACACACAAGG ( SEQ ID NO: 348) 
20 ACACAAGGCT (SEQ ID NO: 350) 

ACAAGGCTAC (SEQ ID NO: 352); 

AAGGCTACTT (SEQ ID NO: 354); 

GGCTACTTCC (SEQ ID NO: 356); 

CTACTTCCCT (SEQ ID NO: 358); 
25 ACTTCCCTGA ( SEQ ID NO: 360) ; 

TTCCCTGATT (SEQ ID NO: 362); 

CCCTGATTGG (SEQ ID NO: 364); 

CTGATTGGCA (SEQ ID NO: 366); 

GATTGGGAGA (SEQ ID NO: 368); 
30 TTGGCAGAAC (SEQ ID NO: 370); 

GGCAGAACTA (SEQ ID NO: 372); 

CAGAACTACA ( SEQ ID NO: 374); 

GAACTACACA (SEQ ID NO: 376); 

ACTACACACC (SEQ ID NO: 378); 
35 TACACACCAG (SEQ ID NO: 380); 

CACACCAGGG (SEQ ID NO: 382); 

CACCAGGGCC (SEQ ID NO: 384); 

CCAGGGCCA6(SEQ ID NO: 386); 



AAGAAGACAA (SEQ ID NO: 313} 
GAAGACAAGA ( SEQ ID NO: 315) 
AGACAAGATA (SEQ ID NO: 317) 
ACAAGATATC (SEQ ID NO: 319) 
AAGATATCCT ( SEQ ID NO: 321) 
GATATCCTTG (SEQ ID NO: 323) 
TATOCTTGAT (SEQ ID NO: 325) 
TCCTTGATCT (SEQ ID NO: 327); 
CTTGATCTGT (SEQ ID NO: 329) 
TGATCTGTGG (SEQ ID NO: 331} 
ATCTGTGGAT (SEQ ID NO: 333) 
CTGTGGATCT (SEQ ID NO: 335) 
GTGGATCTAC (SEQ ID NO: 337) 
GGATCTACCA (SEQ ID NO: 339) 
ATCTACCACA (SEQ ID NO: 341) 
CTACCACACA(SEQ ID NO: 343) 
ACCACACACA(SEQ ID NO: 345) ; 
CACACACAAG (SEQ ID NO: 347) 
CACACAAGGC (SEQ ID NO: 349) 
CACAAGGCTA (SEQ ID NO: 351) 
CAAGGCTACT (SEQ ID NO: 353) 
AGGCTACTTC (SEQ ID NO: 355) 
GCTACTTCCC (SEQ ID NO: 357) 
TACTTCCCTG (SEQ ID NO: 359) 
CTTCCCTGAT (SEQ ID NO: 361) 
TCCCTGATTG (SEQ ID NO: 363) 
CCTGATTGGC (SEQ ID NO: 365) 
TGATTGGCAG (SEQ ID NO: 367) 
ATTGGCAGAA (SEQ ID NO: 369) ; 
TGGCAGAACT (SEQ ID NO: 371); 
GCAGAACTAC (SEQ ID NO: 373) 
AGAACTACAC (SEQ ID NO: 375) 
AACTACACAC (SEQ ID NO: 377) 
CTACACACCA(SEQ ID NO: 379) 
ACACACCAGG (SEQ ID NO: 381); 
ACACCAGGGC (SEQ ID NO: 383); 
ACCAGGGCCA (SEQ ID NO: 385) ; 
CAGGGCCAGG (SEQ ID NO: 387); 
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AGGGCCAGGG {SEQ ID NO: 
GGCCAGGGGT (SEQ ID NO: 
CCAGGGGTCA { SEQ ID NO: 
AGGGGTCAGA ( SEQ ID NO: 
5 GGGTCAGATA ( SEQ ID NO: 
GTCAGATATC (SEQ ID NO: 
CAGATATCCA (SEQ ID NO: 
GATATCCACT (SEQ ID NO: 
TATCCACTGA ( SEQ ID NO: 
10 TCCACTGACC(SEQ ID NO: 
GACTGACCTT (SEQ ID NO: 
CTGACCTTTG (SEQ ID NO: 
GACCTTTGGA ( SEQ ID NO: 
CCTTTGGATG ( SEQ ID NO: 
15 TTTGGATGGT (SEQ ID NO: 
TGGATGGTGC (SEQ ID NO: 
GATGGTGCTA (SEQ ID NO: 
TGGTGCTACA (SEQ ID NO: 
GTGCTACAAG (SEQ ID NO: 
20 GCTACAAGCT (SEQ ID NO: 
TACAAGCTAG (SEQ ID NO: 
CAAGCTAGTA ( SEQ ID NO: 
AGCTAGTACC ( SEQ ID NO: 
CTAGTACCAG (SEQ ID NO: 
25 AGTACCAGTT (SEQ ID NO: 
TACCAGTTGA ( SEQ ID NO: 
CCAGTTGAGCfSEQ ID NO: 
AGTTGAGCGA (SEQ ID NO: 
TTGAGCCAGA (SEQ ID NO: 
30 GAGCCAGATA (SEQ ID NO: 
GCCAGATAAG (SEQ ID NO: 
CAGATAAGGT (SEQ ID NO: 
GATAAGGTAG (SEQ ID NO: 
TAAGGTAGAA ( SEQ ID NO: 
35 AGGTAGAAGA (SEQ ID NO: 
GTAGAAGAGG { SEQ ID NO: 
AGAAGAGGCC(SEQ ID NO: 
AAGAGGCCAA(SEQ ID NO: 
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388); GGGCCAGGGG (SEQ ID NO: 389); 
390); GCCAGGGGTC (SEQ ID NO: 391); 
392); CAGGGGTCAG (SEQ ID NO: 393); 
394); GGGGTGAGAT (SEQ ID NO: 395); 
396); GGTCAGATAT (SEQ ID NO: 397); 
398); TCAGATATCC (SEQ ID NO: 399); 
400) ; AGATATCCAC (SEQ ID NO: 401); 
402); ATATCCACTG (SEQ ID NO: 403); 
404); ATCCACTGAC (SEQ ID NO: 405); 
406); CCACTGACCT (SEQ ID NO: 407); 
408); ACTGACCTTT (SEQ ID NO: 409); 
410); TGACCTTTGG (SEQ ID NO: 411); 
412); ACCTTTGGAT (SEQ ID NO: 413); 
414); CTTTGGATGG (SEQ ID NO: 415); 
416); TTGGATGCTG (SEQ ID NO: 417); 
418); GGATGGTGCT (SEQ ID NO: 419); 
420); ATGGTGCTAC (SEQ ID NO: 421); 
422); GGTGCTACAA (SEQ ID NO: 423); 
424); TGCTACAAGC (SEQ ID NO: 425); 
426); CTACAAGCTA (SEQ ID NO: 427); 
428); ACAAGCTAGT (SEQ ID NO: 429); 
430); AAGCTAGTAC (SEQ ID NO: 431); 
432); GCTAGTACCA (SEQ ID NO: 433); 
434); TAGTACCAGT (SEQ ID NO: 435); 
436) ; GTACCAGTTG (SEQ ID NO: 437) ; 
438); ACCAGTTGAG (SEQ ID NO: 439); 
440); CAGTTGAGCC (SEQ ID NO: 441);- 
442); GTTGAGCCAG (SEQ ID NO: 443); 
444); TGAGCCAGAT (SEQ ID NO: 445); 
446); AGCCAGATAA (SEQ ID NO: 447); 
448); CCAGATAAGG (SEQ ID NO: 449); 
450); AGATAAGGTA (SEQ ID NO: 451); 
452); ATAAGGTAGA { SEQ ID NO: 453); 
454); AAGGTAGAAG (SEQ ID NO: 455); 
456); GGTAGAAGAG (SEQ ID NO: 457); 
458); TAGAAGAGGC (SEQ ID NO: 459); 
460); GAAGAGGCCA(SEQ ID NO: 461); 
462); AGAGGCCAAT (SEQ ID NO: 463); 
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GAGGCCAATA (SEQ ID NO: 464! 
GGCCAATAAA (SEQ ID NO : 466 
CCAATAAAGG (SEQ ID NO: 468] 
AATAAAGOAG (SEQ ID NO: 470] 
5 TAAAGGAGAG { SEQ ID NO: 472] 
AAGGAGAGAA (SEQ ID NO: 474] 
GGAGAGAACA (SEQ ID NO: 476] 
AGAGAACACC (SEQ ID NO: 478] 
AGAACACCAG (SEQ ID NO: 480] 

10 AACACCAGCT (SEQ ID NO: 482] 
CACCAGCTTG(SEQ ID NO: 484] 
CCAGCTTGTT (SEQ ID NO: 486! 
A6CTTGTTAC (SEQ ID NO: 488] 
CTTGTTACAC (SEQ ID NO: 490] 

IS TGTTACACCC (SEQ ID NO: 492] 
TTACACCCTG (SEQ ID NO: 494] 
ACACCCTGTG (SEQ ID NO: 496] 
ACCCTGTGAG (SEQ ID NO: 498] 
CCTGTGAGCC (SEQ ID NO: 500] 

20 TGTGAGCCTG (SEQ ID NO: 502] 
TGAGCCTGCA ( SEQ ID NO: 504] 
AGCCTGCATG (SEQ ID NO: 506! 
CCTGCATGGA (SEQ ID NO: 508] 
TGCATGGAAT (SEQ ID NO: 510] 

25 CATGGAATGG ( SEQ ID NO: 512! 
TGGAATGGAT (SEQ ID NO: 514! 
GAATGGATGA (SEQ ID NO: 516! 
ATGGATGACC (SEQ ID NO: 518! 
GGATGACCCT (SEQ ID NO: 520! 

30 ATGACCCTGA (SEQ ID NO: 522 
GACCCTGAGA (SEQ ID NO: 524 
CCCTGAGAGA (SEQ ID NO: 526! 
CTGAGAGAGA(SEQ ID NO: 528 
GAGAGAGAAG(SEQ ID NO: 530! 

35 GAGAGAAGTG (SEQ ID NO: 532 
GAGAAGTGTT (SEQ ID NO: 534 
GAAGTGTTAG ( SEQ ID NO: 536 
AGTGTTAGAG (SEQ ID NO: 538 



; AGGCCAATAA (SEQ 
; GCCAATAAAG (SEQ 
; CAATAAAGGA (SEQ 
; ATAAAGGAGA (SEQ 
; AAAGGAGAGA (SEQ 
; AGGAGAGAAC (SEQ 
; GAGAGAACAC (SEQ 
; GAGAACACCA ( SEQ 
; GAACACCAGC (SEQ 
; ACACCAGCTT (SEQ 
; ACCAGCTTGT (SEQ 
; CAGCTTGTTA (SEQ 
; GCTTGTTACA (SEQ 
; TTGTTACACC (SEQ 
; GTTACACCCT (SEQ 
f TACACCCTGT (SEQ 
; CACCCTGTGA (SEQ 
; CCCTGTGAGC (SEQ 
; CTGTGAGCCT (SEQ 
; GTGAGCCTGC (SEQ 
; GAGCCTGCAT (SEQ 
; GCCTGCATGG (SEQ 
; CTGCATGGAA ( SEQ 
; GCATGGAATG ( SEQ 
; ATGGAATGGA (SEQ 
; GGAATGGATG ( SEQ 
; AATGGATGAC (SEQ 
; TGGATGACCC (SEQ 
; GATGACCCTG (SEQ 
; TGACCCTGAG (SEQ 
; ACCCTGAGAG ( SEQ 
; CCTGAGAGAG (SEQ 
; TGAGAGAGAA(SEQ 
; AGAGAGAAGT (SEQ 
; AGAGAAGTGT (SEQ 
; AGAAGTGTTA ( SEQ 
; AAGTGTTAGA ( SEQ 
; GTGTTAGAGT (SEQ 



ID 


NO: 


465) ; 


ID 


NO: 


467) ; 


ID 


NO: 


469) ; 


ID 


NO: 


471) ; 


ID 


NO: 


473) ; 


ID 


NO: 


475) ; 


ID 


NO: 


477) ; 


ID 


NO: 


479; ; 


ID 


NO: 


481; ; 


ID 


NO: 


483} ; 


ID 


NO: 


485) ; 


ID 


NO: 


487) ; 


ID 


NO: 


489) ; 


ID 


NO: 


491) ; 


ID 


NO: 


493) ; 


ID 


NO: 


495} ; 


ID NO: 


497) ; 


ID 


NO: 


499} ; 


ID 


NO: 


501) ; 


ID 


NO: 


503) ; 


ID 


NO: 


505) ; 


ID 


NO: 


507) ; 


ID 


NO: 


509) ; 


ID 


NO: 


511) ; 


ID 


NO: 


513) ; 


ID 


NO: 


515) ; 


ID 


NO: 


517) ; 


ID 


NO: 


519) ; 


ID 


NO: 


521} ; 


ID 


NO: 


523) ; 


ID 


NO: 


525) ; 


ID 


NO: 


527) ; 


ID 


NO: 


529) ; 


ID 


NO: 


531) ; 


ID 


NO: 


533) ; 


ID 


NO: 


535) ; 


ID 


NO: 


537) ; 


ID 


NO: 


539) ; 
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TGTTAGAGTG (SEQ ID NO: 540) 

TTAGAGTGGA (SEQ ID NO: 542) 

AGAGTGGAGG (SEQ ID NO: 544), 

AGTGGAGGTT (SEQ ID NO: 546); 
5 TGGAGGTTTG (SEQ ID NO: 548); 

GAGGTTTGAC (SEQ ID NO: 550) ; 

GGTTTGACAG (SEQ ID NO: 552); 

TTTGACAGCC (SEQ ID NO: 554); 

TGACAGCCGC (SEQ ID NO: 556); 
10 ACAGCCGCCT (SEQ ID NO: 558); 

AGCCGCCTAG (SEQ ID NO: 560); 

C06CCTA0CA(SEQ ID NO: 562); 

GCCTAGCATT (SEQ ID NO: 564); 

CTAOCATTTC (SEQ ID NO: 566); 
IS AGCATTTCAT (SEQ ID NO: 568); 

CATTTCATCA ( SEQ ID NO: 570); 
TTTCATCACG (SEQ ID NO: 572) ; 
TCATCAOGTG (SEQ ID NO: 574); 
ATCAOGTGGC ( SEQ ID NO: 576); 
20 CACGTGGCCC(SEQ ID NO: 578) ; 
OGTGGCCCGA (SEQ ID NO: 580) ; 
TGGCCCGAGA ( SEQ ID NO: 582) ; 
6CCCGAGAGC(SEQ ID NO: 584); 
CCGAGAGCTG (SEQ ID NO: 586) ; 
25 GAGAGCTGCA ( SEQ ID NO: 588) ; 
GAGCTGCATC (SEQ ID NO: 590); 
GCTGCATCCG (SEQ ID NO: 592) ; 
TGCATCCGGA (SEQ ID NO: 594) ; 
CATCCGGAGT (SEQ ID NO: 596); 
30 TCCGGAGTAC ( SEQ ID NO: 598); 
CGGAGTACTT(SEQ ID NO: 600); 
GAGTACTTCA ( SEQ ID NO: 602) ; 
GTACTTCAAG (SEQ ID NO: 604) ; 
ACTTCAAGAA(SEQ ID NO: 606) ; 
35 TTCAAGAACT (SEQ ID NO: 608); 
CAAGAACTGC (SEQ ID NO: 610); 
AGAACTGCTG (SEQ ID NO: 612); 



PCT/AU95/00063 

-24- 

GTTAGAGTGG ( SEQ ID NO: 541) ; 

TAGAGT6GAG ( SEQ ID NO: 543); 

GAGTGGAGGT (SEQ ID NO: 545); 

GTGGAGGTTT (SEQ ID NO: 547); 

GGAGGTTTGA(SEQ ID NO: 549); 

AGGTTTGACA ( SEQ ID NO: 551) ; 

GTTTGACAGC (SEQ ID NO: 553); 

TTGACAGCCG (SEQ ID NO: 555) ; 

GACAGCCGCC(SEQ ID NO: 557); 

CAGCCGCCTA(SEQ ID NO: 559) ; 

GCCGCCTAGC (SEQ ID NO: 561); 

CGCCTAGCAT (SEQ ID NO: 563); 

CCTAGCATTT (SEQ ID NO: 565); 

TAGCATTTCA ( SEQ ID NO: 567); 

GCATTTCATC (SEQ ID NO: 569); 

ATTTCATCAC (SEQ ID NO: 571} ; 

TTCATCACGT (SEQ ID NO: 573); 

CATCACGTGG (SEQ ID NO: 575); 
TCACOTGGCC(SEQ ID NO: 577); 
ACGTGGCCCG (SEQ ID NO: 579); 
GTGGCCCGAG (SEQ ID NO: 581); 
GGCCCGAGAG (SEQ ID NO: 583); 
CCCGAGAGCT ( SEQ ID NO: 585); 
CGAGAGCTGC (SEQ ID NO: 587); 
AGAGCTGCAT (SEQ ID NO: 589) ; 
AGCTGCATCC (SEQ ID NO: 591); 
CTGCATCCGG (SEQ ID NO: 593); 
GCATCOGGAG (SEQ ID NO: 595) ; 
ATCCGGAGTA (SEQ ID NO: 597) ; 
CCGGAGTACT (SEQ ID NO: 599); 
GGAGTACTTC (SEQ ID NO: 601); 
AGTACTTCAA ( SEQ ID NO: 603) ; 
TACTTCAAGA (SEQ ID NO: 605); 
CTTCAAGAAC (SEQ ID NO: 607); 
TCAAGAACTG (SEQ ID NO: 609) ; 
AAGAACTGCT ( SEQ ID NO: 611) ; 
GAACTGCTGA ( SEQ ID NO: 613) . 
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Generally, the subject HIV-1 isolate is non-pathogenic as hereinbefore defined. 
Additionally, reference herein to n a deletion" includes reference to a contiguous or non- 
contiguous series of two or more deletions. 

5 The non-pathogenic isolate may carry a single decanucleotide deletion or may cany 
more than one decanucleotide deletion. Where it carries multiple deletions these may 
all correspond to a contiguous sequence or be from different parts of the nef gene. 
Furthermore, the terminal end portions of a deletion may lie within a decanucleotide as 
defined above. It is emphasised that the present invention extends to analogous 
10 sequences from other pathogenic strains of HIV-1 which might cany nef genes with a 
slightly altered sequence relative to HIV-1 strain NL4-3. 

In a most preferred embodiment of the present invention, there is provided a non- 
pathogenic strain of HIV-1 carrying a nucleotide sequence in its genome as set forth 
15 in SEQ ID No. 614: 

GAAGAGATTTGG^GAACATGACCT 

aaaatacatatactccttact^^ 
aactattggaattggatcaatgggcaatf 
ctgtggtatataaaaatattcataatgg 
20 ttttgctgtactttctatagtgaa™ 
agaccctcctcccaaccco»ggggac 
ggagagagagacagagacagatccactcgattagt 

GGACGACCTGAGGAGCCTGTGC 
TTGTAACAAGGATTGTGGAACTTCTGG 
25 TGGAACCTCCTAAAGTATTGGAGCCAGKxAACTGGAGAAG^ 
TGCCACCGCCATAGCAGTAGCTGAGGGGACAG^ 
CTTATAGAGCTATCCTCC31CATACCT 

CTATAAAATGG6TGGCAAGTQAGCAAAAAGTAGTGTAGTCAGATAG 

GGGTGGGGGCCAACAACTAACAATGCTGATCGTC 
30 AGAAGCGGGTTTTCCAGTCAAACCTCAGGTAGCTG^ 

AAAAGGGGGGACTGGAAGGGCTAATTCACTC^ 

AAACTATTACCAGTGGAGTGAGCGAAGATA 

ATTGTTCTGTTGGGGACTTTCCATCCGTTGGGGA 

GACTAGTTCCGGTGGGGACTTTCCAAG 
35 GAGCCCTCAGATGCTGCATATAAGCAGCTG CTTTCTGCTCTTACTGGGTCTCTCGGGTT 
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AGACCAGATCTGAGCCTGGGAGCTCTCTGGCTAACTAG^ 

AATAAAGCTTGCCTTGAGTGCTTCAAGTA£^ 

ATCTAGA; 

5 and/or SEQ ID NO: 615: 

GAAACAATTTGGGATAACATGACCTGGATGC^ 

AAAGATAATATACACCTTAATTGAAGAATC& 

AATTATTGGAATTGGATAAATGG^ 

CTATGGTATATAAAATTATTCATAATGGTAGT^ 
10 TTTTACTQTACTTTCTATAGTGAATAGAGTTAGGCA 

AGACCCACCTCCCAACCCCGAAGGGACCC<^ 

GGAGAGAGAGACAGAGGCAGCTCCACTCGATTAGTGCA 

GGACGACX!TGAGGAGTCTGTGCCTCTT 

TTGTAACGAGGATTGTGGAACTTCTGGGACGC^ 
15 TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAA 

TGGCACGGCCATAGGAGTAGCTGAGGGGACAGAT 

CTTATAGAGCTATTCTCC^CATACCTAGAAGAATAA 

CTATAAAATGGGTGGCAAGTGGTCAGAAAGTAGTO^ 

GACAAGGCAGCTATAGATCTTAGCCGCTTTT^ 
20 AATTCACTCACAGAGAAGATCAGTTGAACC^^ 

AAAACAACAGATTGTT CCGTTTGTTCCGTTGGGGACTTTCC^ CTGAGT 

GATAAGCCGCTGGGGACTTTCCGAAGAGGCGTGACGGGACTT^ 

TGGGCGGGACTGGGGAGTGGCGAGCCCTCAGATC 

CTGTACTGGGTCTCTCTGGTTAGACCAGATCT 
25 GGAACCCACTGCTTAAGCCTCAATAAAGCTTGC 

CGTCTGTTGTGTGACTCTGGTATCTAGA - 

The present invention, however, extends to HTV-1 isolates which are non-pathogenic; 
cany genomes capable of hybridising undo: low stringency conditions to SEQ ID NO: 
30 6 1 4 or SEQ ID NO: 61 5; and which do not direct the synthesis of a ftdl length nef gene 
product. 

hx a further embodiment the present invention contemplates a viral isolate which: 

(i) is reactive to antibodies to a glycoprotein from HTV-1 such as at least one of 
35 gp41-45> gpl20 and/or gp!60; 

(ii) carries a genome or a part or fragment thereof capable of hybridising under 
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medium stringency conditions to a nucleotide sequence as set forth in SEQ ID 
NO: 1 or a complementary form thereof or an analogous sequence from another 
pathogenic strain of HTV-1 ; 

Gix) carries a deletion of at least ten nucleotides in a region corresponding to the LTR 
5 region in HTV-1 NL4-3; and 

wherein said deletion encompasses one or more of the following decanucleotides from 

the LTR region of HTV-1 NL4-3 or corresponding sequences from another pathogenic 

strain of HTV-1: 



10 


GCTTTTTGCC 


(SEQ 


ID 


NO: 


652) ; 


CTTTTTGCCT 


(SEQ ID NO: 


653) ; 




TTTTTGCCTG 


(SEQ 


ID 


NOi 


654) i 


> TTTTGCCTGT 


(SEQ ID NO: 


655) ; 




TTTGCCTGTA 


(SEQ 


ID 


NO: 


656) j 


TTGCCTGTAC 


(SEQ ID NO: 


657) ; 




T6CCTGTACT 


(SEQ 


ID 


NO: 


658) j 


GCCTGTACTG 


(SEQ ID NO: 


659) ; 




CCTGTACTGG 


(SEQ 


ID 


NO: 


660) ; 


CTGTACTGGG 


(SEQ ID NO: 


661) ; 


15 


TGTACTGGGT 


(SEQ 


ID 


NO: 


662) j 


GTACTGGGTC 


(SEQ ID NO: 


663) ; 




TACTGGGTCT 


(SEQ 


ID 


NO: 


664) ; 


• ACTGGGTCTC 


(SEQ ID NO: 


665) ; 




CTGGGTCTCT 


(SEQ 


ID 


NO: 


666) ; 


TGGGTCTCTC 


(SEQ ID NO: 


667) / 




GGGTCTCTCT 


(SEQ 


ID 


NO: 


668) j 


GGTCTCTCTG 


(SEQ ID NO: 


669 ) ; 




GTCTCTCTGG 


(SEQ 


ID 


NO: 


670) ; 


TCTCTCTGGT 


(SEQ ID NO: 


671) / 


20 


CTCTCTGGTT 


(SEQ 


ID 


NO: 


672) , 


TCTCTGGTTA 


(SEQ ID NO: 


673) ; 




CTCTGGTTAG 


(SEQ 


ID 


NO: 


674) , 


■ TCTCTGGTTA 


(SEQ ID NO: 


675) ; 




CTGGTTAGAC 


(SEQ 


ID 


NO: 


676) , 


• TGGTTAGACC 


(SEQ ID NO: 


677) ; 




GGTTAGACCA 


(SEQ 


ID 


NO: 


678) , 


■ GTTAGACCAG 


(SEQ ID NO: 


679) ; 




TTAGACCAGA 


(SEQ 


ID 


NO: 


680) , 


; TAGACCAGAT 


(SEQ ID NO: 


681) ; 


25 


AGACCAGATC 


(SEQ 


ID 


NO: 


682) 


GACCAGATCT 


(SEQ ID NO: 


683) ; 




ACCAGATCTG 


(SEQ 


ID 


NO: 


684) 


. CCAGATCTGA 


(SEQ ID NO: 


685) ; 




CAGATCTGAG 


(SEQ 


ID 


NO: 


686) ; AGATCTGAGC 


(SEQ ID NO: 


687) ; 




GATCTGAGCC 


(SEQ 


ID 


NO: 


688) ; ATCTGAGCCT 


(SEQ ID NO: 


689) ; 




TCTGAGCCTG 


(SEQ 


ID 


NO: 


690) CTGAGCCTGG 


(SEQ ID NO: 


691) ; 


30 


TGAGCCTGGG 


(SEQ 


ID 


NO: 


692) ; GAGCCTGGGA 


(SEQ ID NO: 


693) ; 




AGCCTGGGAG 


(SEQ 


ID 


NO: 


694) 


, GCCTGGGAGC 


(SEQ ID NO: 


695) ; 




CCTGGGAGCT 


(SEQ 


ID 


NO: 


696) ; CTGGGAGCTC 


(SEQ ID NO: 


697) ; 




TGGGAGCTCT 


(SEQ 


ID 


NO: 


698) ; GGGAGCTCTC 


(SEQ ID NO: 


699); 




GGAGCTCTCT 


(SEQ 


ID 


NO: 


700) 


i GAGCTCTCTG 


(SEQ ID NO: 


701) 


35 


AGCTCTCTGG 


(SEQ 


ID 


NO: 


702) 


; GCTCTCTGGC 


(SEQ ID NO: 


703) ; 




CTCTCTGGCT 


(SEQ 


ID 


NO: 


704) 


; TCTCTGGCTA 


(SEQ ID NO: 


705) ; 
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CTCTGGCTAA {SEQ ID NO: 706); 
CTGGCTAACT {SEQ ID NO: 708); 
GGCTAACTAG (SEQ ID NO: 710); 
CTAACTAGGG (SEQ ID NO: 712); 
5 AACTAGGGAA (SEQ ID NO: 714) ; 
CTAGGGAACC (SEQ ID NO: 716); 
AGGGAACCCA (SEQ ID NO: 718) ; 
GGAACCCACT (SEQ ID NO: 720) ; 
AACCCACTGC (SEQ ID NO: 722); 
10 CCCACTGCTT (SEQ ID NO: 724); 
CACTGCTTAA (SEQ ID NO: 726); 
CTGCTTAAGC (SEQ ID NO: 728); 
GCTTAAGCCT (SEQ ID NO: 730); 
TTAAGCCTCA (SEQ ID NO: 732) ; 
15 AAGCCTCAAT (SEQ ID NO: 734); 
GCCTC&ATAA (SEQ ID NO: 736) ; 
CTCAATAAAG (SEQ ID NO: 738); 
CAATAAAGCT (SEQ ID NO: 740) ; 
ATAAAGCTTG (SEQ ID NO: 742); 
20 AAAGCTTGCC (SEQ ID NO: 744); 
AGCTTGCCTT (SEQ ID NO: 746); 
CTTGCCTTGA (SEQ ID NO: 748); 
TGCCTTGAGT (SEQ ID NO: 750); 
CCTTGAGTGC (SEQ ID NO: 752); 
25 TTGAGTGCTT (SEQ ID NO: 754) ; 
GAGTGCTTCA (SEQ ID NO: 756) ; 
GTGCTTCAAG (SEQ ID NO: 758) ; 
GCTTCAAGTA (SEQ ID NO: 760); 
TTCAAGTAGT (SEQ ID NO: 762) ; 
30 CAAGTAGTGT (SEQ ID NO: 764); 
AGTAGTGTGT (SEQ ID NO: 766); 
TAGTQTGTGC (SEQ ID NO: 768); 
GTGTGTQCCC (SEQ ID NO: 770) ; 
GTGTGCCCGT (SEQ ID NO: 772); 
35 GTGCCCGTCT (SEQ ID NO: 774); 
GCCCGTCTGT (SEQ ID NO: 776); 
CCGTCTGTTG (SEQ ID NO: 778) ; 
GTCTGTTGTG (SEQ ID NO: 780); 
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TCTGGCTAAC (SEQ ID NO: 707); 

TGGCTAACTA (SEQ ID NO: 709); 

GCTAACTAGG (SEQ ID NO: 711) ; 

TAACTAGGGA (SEQ ID NO: 713); 

ACTAGGGAAC (SEQ ID NO: 715); 

TAGGGAACCC (SEQ ID NO: 717) ; 

GGGAACCCAC (SEQ ID NO: 719); 

GAACCCACTG (SEQ ID NO: 721); 

ACCCACTGCT (SEQ ID NO: 723); 

CCACTGCTTA (SEQ ID NO: 725); 

ACTGCTTAAG (SEQ ID NO: 727); 

TGCTTAAGCC (SEQ ID NO: 729) ; 

CTTAAGCCTC (SEQ ID NO: 731); 

TAAGCCTCAA (SEQ ID NO: 733); 
AGCCTCAATA (SEQ ID NO: 735); 

CCTCAATAAA (SEQ ID NO: 737) ; 

TCAATAAAGC (SEQ ID NO: 739); 
AATAAAGCTT (SEQ ID NO: 741); 
TAAAGCTTGC (SEQ ID NO: 743); 
AAGCTTGCCT (SEQ ID NO: 745); 
GCTTGCCTTG (SEQ ID NO: 747); 
TTGCCTTGAG (SEQ ID NO: 749) ; 
GCCTTGAGTG (SEQ ID NO: 751); 
CTTGAGTGCT (SEQ ID NO: 753); 
TGAGTGCTTC (SEQ ID NO; 755); 
AGTGCTTCAA (SEQ ID NO: 757); 
TGCTTCAAGT (SEQ ID NO: 759); 
CTTCAAGTAG (SEQ ID NO: 761); 
TCAAGTAGTG (SEQ ID NO: 763); 
AAGTAGTGTG (SEQ ID NO: 765) ; 
GTAGTGTGTG (SEQ ID NO: 767); 
AGTGTGTGCC (SEQ ID NO: 769); 
TGTGTGCCCG (SEQ ID NO: 771) ; 
TGTGCCCGTC (SEQ ID NO: 773); 
TGCCCGTCTG (SEQ ID NO: 775) ; 
CCCGTCTGTT (SEQ ID NO: 777) ; 
CGTCTGTTGT (SEQ ID NO: 779); 
TCTGTTGTGT (SEQ ID NO: 781); 
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CTGTTGTQTO (SEQ ID NO: 782); TGTTGTGTGA {SEQ ID NO: 783); 
GTTGTGTGAC (SEQ ID NO: 784); TTGTGTGACT (SEQ ID NO: 785); 
TGTGTGACTC (SEQ ID NO: 786); GTGTGACTCT (SEQ ID NO: 787); 
TGTGTGACTC (SEQ ID NO: 788); GTGTGACTCT (SEQ ID NO: 789); 
5 TGTGACTCTG (SEQ ID NO: 790) ; GTGACTCTGG (SEQ ID NO: 791) ; 
TGACTCTGGT (SEQ ID NO: 792); GACTCTGGTA (SEQ ID NO: 793); 
ACTCTGGTAA (SEQ ID NO: 794); CTCTGGTAAC (SEQ ID NO: 795); 
TCTGGTAACT (SEQ ID NO: 796); CTGGTAACTA (SEQ ID NO: 797); 
TGGTAACTAG (SEQ ID NO: 798); GGTAACTAGA (SEQ ID NO: 799). 

10 

The non-pathogenic isolate may cany a angle decanucleotide deletion in the LTR region 
or may carry multiple deletions in the same region or in the LTR region and another 
region such as the nef gene. In particular, the mutation may be in the LTR/nef overlap 
region. Where it carries multiple deletions, these may correspond to a contiguous 
15 sequence or be from different parts of the LTR region and/or nef gene. Furthermore, 
the terminal end portions of a deletion may lie within a decanucleotide as defined above. 

Yet another aspect of the present invention provides an infectious molecular clone 
comprising genetic sequences derived from the non-pathogenic HTV-I isolates as 
20 hereinbefore described and includes genetic sequences encoding major structural proteins 
such as gag em and pol Infectious molecular clones are particularly useful as genetic 
compositions capable of "infecting" host cells without need of viral coat The infectious 
molecular clones of the present invention may also be derived from pathogenic HTV-1 
strains rendered non-pathogenic as hereindescribed 

25 

According to this latter embodiment, there is contemplated a method of attenuating a 
pathogenic strain of HIV- 1 , said method comprising inducing a mutation in the nef gene 
and/or an LTR region to generate a nonpathogenic HIV-1 strain as hereinbefore 
described Preferred mutations are deletions of at least ten nucleotides such as one or 
30 more of the decanucleotides as hereinbefore described The mutations may also 
constitute substitutions and/or insertions of heterologous nucleic acid molecules in the 
nef and/or LTR regions such as die incorporation of a sense (Le. co-suppression) 
antisense or ribozyme molecule (see below). 
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In still yet another embodiment of the present invention, there is provided an isolated, 
non-pathogenic strain of HTV-1 comprising a deletion in its genome of at least 10 
nucleotide within the region of nucleotide 8787 and 9709 and more particularly within 
the region 8800 and 9700 and even more preferably within the region 8800 and 9410, 
5 using the nucleotide numbering of HTV-1 NL4-3. 

In one embodiment, the deletion of at least 10 nucleotides is from within a region 
selected from the list consisting of (using the nucleotide numbers of HTV-1 NL4-3): 
nucleotide (i) 8830-8862; 
10 (ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

In another embodiment, the deletion of at least 10 nucleotides is from within a region 
IS selected from the list consisting of (using the nucleotide numbers of HIV- 1 NL4-3): 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

20 

In yet another embodiment, the deletion of at least 10 nucleotides is from within a 
region selected from the list consisting of (using the nucleotide numbers of HTV-1 NL4- 
3): 

nucleotide (ix) 8882-8928; 
25 (x) 8850-9006; 

(xQ 8792-9041; and 

(xii) 9112-9204. 



30 



In still yet another emrxxliment, the deletion of at least 10 nucleotides is from within a 
region selected from the list consisting of (using the nucleotide numbers of HTV-1 NL4- 
3): 
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nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

5 

. The above embodiments including any combinations thereof or functional equivalents 
thereof. Most preferred variants of HTV-1 axe defined in Table 3. 

Reference herein to the non-pathogenic HIV-1 strains of the present invention includes 
10 reference to components, parts, fragments and derivatives thereof including both genetic 
and non-genetic material. Furthermore, the non-pathogenic HIV-1 strains may be in 
isolated form or resident in peripheral blood mononuclear cells (PBMCs) or other like 
cells where the genome of the HIV-1 strains is integrated as DNA from said HIV-1 
strains such as proviral DNA. In addition, the present invention extends to recombinant 
IS virus such as from (or resident in) prokaryotes or eukaiyotes as well as in the form of 
infectious molecular clones. 

Accordingly, the present invention provides for the non-pathogenic HIV-1 isolate, 
genomic material therefrom, complementary proviral DNA, molecular infectious clones, 
20 recombinant viral particles or genetic sequences therefrom or cells expressing same or 
blood cells carrying proviral DNA or to any mutants, derivatives, components, 
fragments, parts, homologues or analogues of the foregoing. 

The non-pathogenic HIV-1 strains of the present invention are particularly useful in the 
25 development of therapeutic compositions, therapeutic molecules and/or diagnostic 
reagents. With regards to the former, the non-pathogenic HIV-1 strain may be 
considered as a live attenuated vaccine where individuals carrying DNA derived from 
said non-pathogenic HIV-1 strain such as proviral DNA in target cells are protected from 
infection by a corresponding pathogenic strain. The term "vaccine" is used in its broadest 
30 sense as a therapeutic composition or molecule which prevents or reduces HTV-1 
infection or risk of infection or which ameliorates die symptoms of infection. It may 
involve the stimulation of an immune response or may involve blocking HTV-1 cells 
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receptors and/or the use of genetic compositions, for example, to introduce ribozymes 
or antisense molecules to HTV-1 directed genetic sequences or to prepare infectious 
molecular clones. For convenience, all such compositions are referred hereinafter to 
"therapeutic compositions''. 

5 

Accordingly, the present invention contemplates a method for inhibiting or reducing the 
risk of infection by a pathogenic strain of HTV-1, said method comprising administering 
to a subject a non-pathogenic HIV-1 as hereinbefore defined in an amount effective to 
infect target cells and to generate target cells carrying proviral DNA from said non- 
10 pathogenic HTV-1. More particularly, the present invention contemplates a method for 
inhibiting or reducing productive infection of an individual by a pathogenic strain of 
HIV-1, said method comprising administering to a subject a non-pathogenic isolate of 
HTV-1 in an amount effective to infect target cells and to generate target cells carrying 
proviral DNA from said non-pathogenic HTV-1. By "productive infection" as used in 
15 die specification and claims herein is meant the infection of a cell or cells by a 
pathogenic strain of HTV-1 which leads ultimately to the symptomology of AIDS or 
AIDS related diseases. A cell infected productively produces pathogenic virions. By 
definition, infection of an individual by a non-pathogenic strain of HIV-1 would not lead 
to productive infection. Non pathogenic HIV-1 strains generally replicate to a sufficient 
20 extent to protect cells against challenge by virulent or pathogenic strains. The methods 
of the present invention are applicable prophylactically (Le. to prevent de novo infection) 
or therapeutically (Le. to reduce or slow disease progression). 

The present invention further provides a method for vaccinating an individual against 
25 the development of AIDS or AIDS related diseases, said method comprising 
administering to said individual a non-pathogenic isolate of HIV-1 in an amount 
effective to infect target cells and to generate target cells carrying proviral DNA from 
said non-pathogenic HIV-1. The tern "vaccinating" should not be taken as limiting the 
invention to the prevention of HTV-1 infection by solely immunological m«mg *xhe 
30 term "vaccinating 11 includes any means of preventing productive infection of an 
individual by pathogenic HTV-1. 



WO 95/21912 PCT/AU9S/00063 

-33- 

As an alternative to the above methods, a therapeutic composition as hereinbefore 
defined is administered The non-pathogenic isolate may be administered inter alia as 
an isolated viral preparation or via infected blood cells. Another aspect of the invention 
provides a therapeutic composition for inhibiting or reducing the risk of infection by a 
5 pathogenic strain of HIV-1 said therapeutic composition comprising a non-pathogenic 
strain of HTV-1 or genetic sequences derived therefrom as hereinbefore described and 
optionally one or more pharmaceutical^ acceptable carriers and/or diluents. The 
therapeutic composition of the present invention is generally suitable for intravenous, 
intraperitoneal, intramuscular, intramucosal (e.g. nasal spray, respiratory spray) or other 

10 forms of parenteral administration. The therapeutic composition might also be 
administered via an implant or rectally or orally. In addition to the mutations 
contemplated above, the non-pathogenic HIV-1 strain may also contain one or more 
other mutations to further reduce the risk of reversion to virulence and/or to insert a 
genetic sequence capable of providing directly or indirectly an identifiable signal, having 

IS further antMflV-1 properties and/or immunostimulatoty or cell regulatory properties. 

For example, the non-pathogenic HIV-1 isolate in the therapeutic composition may 
comprise additional genetic material capable of directing expression of antisense 
nucleotide sequences to inhibit expression of one or more proteins encoded by a 
20 pathogenic strain of HIV-1. Alternatively, sense co-suppression may be employed. 
Preferred sense or flntiggnse molecules would reduce expression of the nef gene or affect 
normal functioning of the LTR region. 

According to this embodiment, the non-pathogenic HTV-1 strain may be considered as 
25 a targeting agent to introduce genetic constructs capable of reducing expression of one 
or more HTV-1 proteins or polypeptides. In this embodiment there is provided a viral 
isolate which: 

(i) is genetically or immunologically related to a pathogenic strain of HTV-1; 

(ii) is substantially non-pathogenic in human subjects; 

30 (iii) comprises a first nucleotide sequence constituting its genome which is capable 
of hybridising under medium stringency conditions to SEQ ID NO: 1 or a 
complementary form thereof; and 
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(iv) comprises a second nucleotide sequence within said first nucleotide sequence and 
which second nucleotide sequence directs expression of a mRNA molecule 
capable of inhibiting, reducing or otherwise down-regulating translation of a 
protein or polypeptide encoded by a pathogenic strain ofHTV-1 or inhibit, reduce 
5 or otherwise down regulate operation of a non-protein encoding a region of a 

pathogenic strain of HTV-L 

Preferred proteins or polypeptides targeted for reduced expression are those encoded by 
one or more ofthe following: gag pol, env, tat, rev, vpu, ypr, w/and/or nefgenes. The 

1 0 most preferred protein or polypeptide targeted for reduced expression is the product of 
the nef gene. Alternatively, the target nucleotide sequence does not encode a 
polypeptide or protein but is required for other functions such as integration or excision 
from the human genome or expression of genes on the viral genome. An example of 
such a nucleotide sequence is the LTR region. Accordingly, the present invention 

IS extends to disruption to the function of such regions. 

The above aspect relates to use of antisense technology. The present invention extends, 
however, to use of ribozymes and/or co-suppression to achieve the same results. In an 
alternative embodiment, or in addition to, the second (or optionally a third) nucleotide 
20 sequence encodes an antiviral agent (e.g. interferon) and/or an immune enhancing agent 

The identification of deletions inter alia in the nef gene and/or LTR region in 
asymptomatic subjects provides a unique opportunity to study the in vivo effects of 
attenuated HTV-1 strains carrying one or more mutations in selected genetic regions, in 

25 particular, the present invention provides a means for designing therapeutic compositions 
directed to inhibiting expression of a nef gene and/or LTR region in a pathogenic HIV- 1 
strain (such as contemplated above) as well as developing a therapeutic reghnem aimed 
at inhibiting the activity of die nef gene product for the function of the LTR region. 
According to this latter embodiment, the presort invention provides a therapeutic 

30 composition comprising a molecule capable of inhibiting the intracellular activity ofthe 
nef gene product and/or LTR region, said composition further comprising one or more 
phannaceutically acceptable carriers and/or diluents. 



WO 95/21912 PCT7AU95/00063 

-35- 

The molecule contemplated by the above aspect of the subject invention may be a 
protein, polypeptide, peptide, chemical compound, sugar moiety or derivative of the nef 
gene product The molecule will need to be capable of entering an infected cell. Where 
the molecule is a protein, polypeptide or peptide, it may be encoded by the second 
5 nucleotide sequence on the targeting vector as contemplated above. Alternatively, the 
molecule may be a nucleic acid molecule capable of targeting the nef gene or LTR 
region. 

The deletion mutants of the present invention may result in a modified nef gene product 
10 either having no readily dxscernable activity or having activity different to the naturally 
occurring nef protein. In any event, if a mutant nef gene product is produced, it will 
generally have a lower molecular weight than the naturally occurring n^f protein and will 
have a different overall amino acid sequence. This provides, therefore, for a means for 
diagnosing individuals with benign HTV-1 infection by, for example, assaying for a 
IS modified nef protein or screening for a modified nef gene sequence. Alternatively, 
benign HTV-1 infection may be detected by assaying for a modified LTR region such 
as an altered nucleotide sequence. 

According to one embodiment, there is provided a method for determining the 
20 pathogenicity of an HIV-1 strain after said HIV-1 strain infects cells of an individual, 
said method comprising contacting a biological sample from said individual with an 
effective amount of an antibody specific to a nef protein from a non-pathogenic strain 
of HIV-1 (as hereinbefore defined) for a time and under conditions sufficient to form 
an antibody-modified nef protein complex and then detecting said complex. The 
25 presence of said complex is indicative of a modified-*^ gene product and of the non- 
pafhogenicity of the strain of HIV-1. The biological sample is a sample likely to contain 
the modified nef gene product such as tissue extract or cell extract of an infected cell. 
However, where the modified nef gene product is capable of permeation or transport out 
of the cell, suitable biological fluid would include serum, whole blood/ lymph and 
30 mucosal secretion amongst other fluids. Many variations in the subject assay are 
possible and are contemplated herein. For example, an assay could be based on the 
inability for a nef specific antibody to bind to a modified nef protein. For the purposes 
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of the present invention the term "contacting" including "mixing". 



The presence of a modified /^/molecule in biological fluid can be detected using a wide 
range of immunoassay techniques such as those described in US Patent Nos. 4,016,043, 
5 4,424,279 and 4,018,653. These include both single-site and two-site, or "sandwich", 
assays of the non-competitive types, as well as in the traditional competitive binding 
assays and include ELISA and RIA techniques. Sandwich assays are among the most 
useful and commonly used assays and are favoured for use in the present invention, A 
number of variations of the sandwich assay technique exist, and all are intended to be 

10 encompassed by the present invention. Briefly, and by way of example only, in a 
typical forward assay, a modified nef product-specific antibody is immobilised onto a 
solid substrate to form a first complex and the sample to be tested for modified nef 
product brought into contact with the bound molecule. After a suitable period of 
incubation, for a period of time sufficient to allow formation of an antibody-modified 

15 nef product secondary complex, a second modified nef protein antibody, labelled with 
a reporter molecule capable of producing a detectable signal, is then added and 
incubated, allowing time sufficient for the formation of a tertiary complex of antibody- 
modified nef product-antibody. Any unreacted material is washed away, and the 
presence of the tertiary complex is determined by observation of a signal produced by 

20 the reporter molecule. The results may either be qualitative, by simple observation of 
the visible signal or may be quantitated by comparison with a control sample containing 
known amounts of hapten. Variations of the forward assay include a simultaneous assay, 
in which both sample and labelled antibody are added simultaneously to the bound 
antibody, or a reverse assay in which the labelled antibody and sample to be tested are 

25 first combined, incubated and then added simultaneously to the bound antibody. These 
techniques are well known to those skilled in the art, and the possibility of minor 
variations will be readily apparent The antibodies used above may be monoclonal or 
polyclonal. 

30 The solid substrate is typically glass or a" polymer, the most commonly used polymers 
being cellulose, polyaaylamide, nylon, polystyrene, polyvinyl chloride or polypropylene. 
The solid supports may be in the form of tubes, beads, discs or mkroplates, or any other 
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surface suitable for conducting an immunoassay. The binding processes are well-known 
in the art and generally consist of cross-linking covalently binding or physically 
adsorbing the molecule to the insoluble carrier. 

5 By "reporter molecule", as used in the present specification, is meant a molecule which, 
by its chemical nature, produces an analytically identifiable signal which allows die 
detection of antigen-bound antibody. Detection may be either qualitative or quantitative. 
The most commonly used reporter molecule in this type of assay re either enzymes, 
fluorophores or radionuclide containing molecules (i.e. radioisotopes). In die case of an 

10 enzyme immunoassay, an enzyme is conjugated to the second antibody, generally by 
means of ghitaraldehyde or periodate. As will be readily recognised, however, a wide 
variety of different conjugation techniques exist which are readily available to one 
skilled in the art Commonly used enzymes include horseradish peroxidase, glucose 
oxidase, p-galactosidase and alkaline phosphatase, amongst others. The substrates to be 

IS used with die specific enzymes are generally chosen for the production p upon hydrolysis 
by the corresponding enzyme, of a detectable colour change. It is also possible to 
employ fluorogenic substrates, which yield a fluorescent product. 

Alternatively, fluorescent compounds, such as fluorescein and rhodamine, may be 
20 chemically coupled to antibodies without altering their binding capacity. When activated 
by illumination with light of a particular wavelength, the fluorochrome-labelled antibody 
adsorbs the light energy, inducing a state of excitability in the molecule, followed by 
emission of the light at a characteristic colour visually detectable with a light 
microscope. As in the EIA, the fluorescent labelled antibody is allowed to bind to the 
25 first antibody-hapten complex. After washing off the unbound reagent, the remaining 
complex is then exposed to the light of the appropriate wavelength, the fluorescence 
observed indicates the presence of the hapten of interest Immunofluorescence and EIA 
techniques are both very well established in the art and are particularly preferred for the 
present method. However, other reporter molecules, such as radioisotope, 
30 chemiluminescent or bioluminescent molecules, may also be employed. It will be 
readily apparent to the skilled technician how to vary the procedure to suit the required 
purpose. It will also be apparent that the foregoing can be used to label a modified nef 
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product and to use same directly in the detection of, for example, circulatory antibodies 
specific to said modified nef product 



5 and/or LTR region. Such a genetic assay may be by Southern or Northern blot analysis, 
PGR analysis or the like using oligonucleotides specific to a deleted region of a nef gene 
and/or LTR region. 

According to this embodiment there is provided a method for determining the 
10 pathogenicity of an HTV-1 strain after said HTV-1 strain infects cells of an individual, 
said method comprising determining directly or indirectly the presence of a deletion 
mutation in the genome of said HIV-1 wherein the presence of a such a mutation is 
indicative of the presence of a non-pathogenic strain of HIV- 1. The deletion mutation 
may result in the genome being unable to synthesise a polypeptide or protein from a 
15 pathogenic strain of HTV-1 or may direct the synthesis of a truncated form of said 
polypeptide or protein* The mutation may also lead to altered expression of a 
polypeptide detectable by, for example, decreased synthesis of a particular protein, such 
as the nef gene product Alternatively, the deletion mutation affects the LTR region or 
a regulatory region of the HTV-1 genome. In either case, affected viruses may also be 
20 detected by, for example, observing low viral copy numbers such as low viral loads. 

Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides within the region from nucleotide 8787 to nucleotide 9709 using the 
nucleotide numbering of HTV-1 strain NL4-3. 



Preferably, said non-pathogenic HTV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 



Alternatively, genetic assays may be conducted to screen for abberations in the nef gene 



25 



nucleotide (i) 



8830-8862; 
9009-9035; 
9019-9029; and 



30 



(ii) 

W) 

Civ) 



9033-9049. 
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Preferably, said non-pathogenic HT\M carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 
5 (vii) 9105-9224; and 

(viu) 9271-9370. 

Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
10 nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(xn) 9112-9204. 

IS Preferably, said non-pathogenic HIV-1 carries a deletion in its genome of at least 10 
nucleotides from within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

20 

The above nucleotide numbers are based on the nucleotide numbering in the NL4-3 
genome* 

Particularly preferred oligonucleotides are based on the deleted regions of the nefgpnt 
25 and/or LTR region such as but not limited to one or more oligonucleotides based on 
SEQ ID NO: 2 to SEQ ID NO: 613 and/or SEQ ID NO: 652 to SEQ ID NO: 799. 

The present invention further extends to kits for the diagnosis of infection by pathogenic 
strains of HIV-1 or for determining the pathogenicity of infecting virus. The kits would 
30 be in compartmental form each comprising one or more suitable reagents for conducting 
the assay. 
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The present invention is further described by the foUowing non-limiting Figures and 
Examples. 

In the Figures: 

5 

Figure 1 is a repr e sen tation showing the alignment of the nucleotide sequences from 
donor D36 peripheral blood mononuclear cell (PBMC) [D36P] and non-pathogenic HIV- 
1 from recipient C18 fflV^y [C18S], C18 HIV^c [C18M] and C98 HIV [C98H] and 
C54 PBMC [C54P] with the equivalent region of HTV-1 NL4-3. Sequences labelled 

10 PBMC are from patient PBMC, those labelled HIV are from virus isolated from patient 
PBMC and grown in culture. Numbering for HTV-1 NL4-3 is as per Myer et al (1992) 
where nucleotide 1 is the first nucleotide of the complete proviral DNA sequence. 
D36P, C18S, C18M, C98H and C54P are numbered from the start of the region 
sequenced. Identity with NL4-3 sequence is shown by (*). Deleted nucleotides are 

IS shown by (-). Spaces introduced to maximise alignment are shown by (,). Features in 
NL4-3 are marked by overiining the sequence, features in D36 and C18 are marked by 
underlining the sequence. 

Figure 2 shows the alignment of encoded amino acid sequences of (a) tat exon 3 and 
20 (b) rev exon 3 from HIV-1 NL4-3, D36 PBMC, CI8 HTV^y and C98 HIV. In-phase 
termination codons (*) and NL4-3 encoded amino acid numbers are shown. 

Figure 3 is a representation showing the alignment of C-terminal envelope glycoprotein 
gp41 amino acid sequences encoded by D36 PMBC, C18 HTV^, C18 fflV^c and 
25 C98 HIV. Numbering is that of the amino acid sequence of the mature envelope 
glycoprotein of HIV NL4-3. Termination in shown by (#). 

Figure 4 is a representation showing alignment of amino acid sequences encoded by the 
we/genes of HIV-1 NL4-3, D36 PBMC, C18 fflV av> C18 HIV^ and C98 HIV. In 
30 phase termination codons are shown by (#). Identical amino adds are shown by (*). 
Residues underlined are those immediately before a deletion. 
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Figure 5 shows a duplication ofNRKB and Spl sequences in D36 PBMC, CI 8 HTV Stv> 
C18 HTV^qc and C98 HIV demonstrated by alignment of their low homology region 
sequences with the NFAB-Spl region of HTV-1 NL4-3. Nucleotide numbering 
according to Figure L Identity withNL4-3 sequence shown by (*) and NFJB and Spl 
sites in NL4-3 overlined. Position of neflLTK region sequence deletion shown by (a). 

Figure 6 is a graphical representation showing replication of CI 8 and C98 viral isolates 
and D36 PBMCs from asymptomatic patients in PHA-stimulated PBMCs. 




Figure 7 is a graphical representation showing replication of viral isolates from 
IS asymptomatic patients in non-PHA stimulated PBMCs. 



20 




228200 is an Australian isolate of HIV-1 described by Kieman, R- et al (1990). Its 
characteristics include being T cell trophic, with fast kinetics, high producer of HIV-1 
and/or SI phototype. 



25 Figure 8 is a graphical representation of cell surface receptor expression for syncytiar 
indu ing (Siy non-syncytia-inducing (NSI)/asymptomatic patient isolates. 



30 



■ 


TfR 




IL-2R 


■ 


CD4 
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228200 is defined in the legend to Figure 7. 243925 is a viral isolate of HTV-1 which 
is monocyte/macrophage trophic and exhibits NSI phenotype (Dr Karen Coats-Fryer, 
PhD thesis entitled "Viral determinants of HIV- 1 syncytium formation", the University 
5 of Melbourne, ParkviUe, Victoria, Australia. 

Figure 9 is a representation of the nucleotide sequence of C18 HTV-1 MBC (SEQ ID NO: 
800). 

1 0 Figure 10(a) -(g) are graphical representations showing clinical immunology of cohort; 
(a) CD3; (bXi) CD4 (ii) CD4%; (c)(i) CDS; (ii) CD8%; (d) lymphocyte count; (e) 
CD4/CD8 ratio; (f) p-2-microglobulin; and (g) Kaplan-Meier estimates of time to 
disease progression (Cox & Oakes, 1989). 

IS Figure 11 is a schematic representation of the deletion mutants of the present invention. 
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A summary of the SEQ ID Nos. used in the subject specification is shown below: 



5 


SEO ID NO: 


DESCRIPTION 






Nucleotide seauence of HTV-1 strain NL4-3 genome 




2-613 


Decamicleotidfcs of fic/" eerie from HTV-1 strain NL4-3 


10 


614 


Partial nucleotide seauence of D36 HTV-1 isolate 




615 


Partial nucleotide seauence of C18 HTV-1 xro^ isolate 




616-625 


PCR nrrrners shnwn in Table 1 




QZO-OJ J 


QAfflfMiM >vri i ii ore; eKAwn in TTcih1#* 0 

ocqucucc pnnicxs snuvvn ixl i uujv ^ 




634 


Amino acta residues i j-z/ 01 xii v-i ini^to /«?/ protein 


15 


635 


rilv-1 Jar exoas (rigurc Z) 




OJO 


niV'i li jo far exons ^rigurc l) 




03 / 


tlTV.i pi 0 avAMfl /T?5mm» 9^ 

ni y-i v*io icu exons ^rigurc *>} 




Ojo 


IJTV-1 MT AJX r*v wnn« fFimiffc 2^ 

XXI V — 1 iXly*T— J rsr CAvlUk \X IgUlw 




t>jy 


UT\7 1 TYIA mu avrnic /Pimm* *)\ 

ill Y-i UjO rev exons ^rigurc zj 


20 


640 


rii v-i cio rev exons \rigure zj 




641 


H1V-1 NL4-3 c-tennmai or gp*n irigure 3j 




642 


HTV-1 D3o oterminai 01 gp4i (rigure j) 




643 


tjt\7 f f^f o /"i |_ LF .. 1 ; ria | /XTifrrm* I'X 

HI V-I Uio OmiliTnfll OI gp*** v" 1 8 un5 J ^ 




644 


rtl v-l fiL*r-J nc/ gene ^rigure h) 


25 


645 


HTV-1 D36 nef gent (Figure 4J 




646 


HTV-1 C18 ne/gene figure 4) 




647 


HIV-1 NL4-3 NFKB/SP1 sequence (Figure 5) 




648 


HTV-1 D36 NFKB/SP1 sequence (Figure 5) 




649 


HIV-1 C18 NFKB/SP1 sequence (Figure 5) 


30 


650 


Nucleotide sequence of nef gene from HTV-1 strain NL4-3 




651 


Nucleotide sequence of em and nef regions of NL4-3 




652-799 


Decanucleotides of LTR region from HTV-1 strain NL4-3 




800 


Nucleotide sequence of CI 8 Hry-l^gc 
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EXAMPLE 1 

Source Material 

For the purposes of the following examples, a non-pathogenic HTV-1 strain was isolated 
5 from a recipient of HTV-1 infected blood. The recipient is designated "08". Other 
recipients are defined as "G54" and "098". The donor is identified herein as n D36 n . 
The place of isolation may be indicated after the abbreviation of "HIV". For example, 
St Vincents Hospital, Sydney (HTV^y) or Macfarlane Burnet Centre of Medical 
Research, Melbourne (HTVj^^. 

10 

Exemplary viral isolates referred to herein as *C18 n and R C98" were deposited at the 
PHLS Centre for Applied Microbiology and Research, European Collection of Animal 
Cell Cultures (ECACQ, Division of Biologies, Potion Down, Salisbury, Wiltshire SP4 
OJG. CI 8 was deposited on 17 October, 1994 under Provisional Accession Number 
IS V94101706 and C98 was deposited on 31 October, 1994 under Provisional Accession 
Number V941031 169. 

Figure 11 is a summary of the deletion mutants of the present invention. 

20 Viruses were isolated by the following procedures: 

L Infected peripheral blood mononuclear cells (PBMCs) were co-cultured with 
HTV-1 seronegative donor PBMCs. A convenient source of seronegaitve donor 
PBMCs is a blood bank The supematants and cells are harvested every 7 days 
and fresh medium added with CD8 depleted PBMCs. CD8 depletion promotes 

25 the ability to isolate HTV-1. The culture and procedure is continued for up to 

approximately 5 weeks; 

2. The infected PBMCs are purified from whole blood and these cells are cultured 
alone for up to 5 weeks. PMBCs alone arc used because the virus is more likely 
30 to be monocytotropic Fresh medium is added weekly and supernatant fluid is 

harvested at this time; 
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Supernatant fluids are harvested every approximately 7 days, fresh medium and 
fresh HIV-1 seronegative CDS depleted PBMCs are added at this time; 

HIV-I seronegative PBMCs are preheated with with M-CSF for approximately 
72 hours prior to the addition of infected PBMCs. M-CSF has been shown to 
enhance HIV-1 replication in monocytes (Gendelman et al, 1988); or 

The supernatant fluid is harvested from the cultures of step 4 every 
approximately 7 days, fresh medium added together with HIV-1 seronegative 
stimulated CD8 depleted PBMCs. The virus is isolated from the infected 
PBMCs. 

EXAMPLE 2 

DNA Preparation and PCX Amplification 
15 Non-pathogenic HTV-1 (e.g. strain CI 8) infected peripheral blood mononuclear cells 
(PBMQ were harvested 4 days after infection of phytohaemagglutinin (PHA) stimulated 
HIV-1 negative donor PBMC cultured by the method of Neate et al (1987) and washed 
in phosphate buffered saline (PBS). PBMC from Donor D36 and Recipients CI 8, CS4 
and C98 were prepared by Ficoll isopaque centrifugation of bufly coat cells and washed 
20 with PBS. 

Approximately 10 7 cells were lysed in 1ml lysis solution (0.45% v/v NP40, 0.45% v/v 
Tween 20, lOmM Tris-HCl pH 8,3, 40mM KCi 2.5 mM MgCl^ and digested with 
60fig/ml proteinase K (Boehringer Mannheim) at 55°C for 1 hour followed by 100°C 
25 for 10 minutes. Lysates were stored at -20°C. 

All polymerase chain reaction (PCR) primers (Table 1) and sequencing primers (Table 
2) were synthesised using an Applied Biosystems model 391 DNA synthesiser using 
phosphoramidite chemistry. 

30 

Strict physical separation was maintained for sample, PCR reagent mix and PCR reaction 
preparations as well as amplification and analysis- Final reaction mixes (50|il) contained 



4. 

5 

5. 

10 
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2 nl neat or diluted cell lysate, 0,2jiM each primer, 200mM dNTPs and 1*25 units Taq 
polymerase (Boehringer Mannheim) in PCR buffer, (lOmMTris-HCl pH8.3, 50mM KC1, 
lOOjog/ml gelatine) adjusted to the optimum MgCl 2 concentration for the primer pair 
(L5-3.0mM). Aliquoted reagent mix was overlaid with 50^x1 mineral oil prior to 
5 addition of DNA template lysate. After template denaturation at 94°C for 3 min 
amplification was achieved with 30 cycles of 94°C, 1 min; 55°C, 1 min; 72°C, 2 nrins. 
A final elongation reaction was conducted at 72°C for 7 minute. For double PCR 
amplification 2 jil of first round product was added to the second reagent mix and 
amplified as before. 

10 

PCR amplified DNA was checked for quality, quantity and fragment size by agarose gel 
electrophoresis in Tris-Acetate-EDTA buffer (Sambrook et al, 1989) stained in ethidhnn 
bromide and viewed by UV transillumination. 

IS EXAMPLE 3 

DNA Sequence Analysis 
The DNA sequence of PCR amplified HTV-1 regions was determined by the 
dideoxynucleotide method (Sanger et al, 1997) using Sequenase T7 polymerase (United 
States Biochemicals). 

20 

PCR amplified DNA was purified by PCR Magic prep resin chromatography (Promega). 
Approximately 2 to 7 fig purified DNA plus lOng specific primer (Table 2) were 
denatured by boiling for 3 ™™ and snap frozen to -20°C. The initial labelling reaction 
was for 3 minutes at 22°C (room temperature) with 35 SdNTP (SOOCi/mmol; Dupont) 

25 followed by dideoxynucleotide termination reactions at 37°C for 5 minute. NP40, to 
0.45% v/v, was included in denaturation and reaction mixes (Bachman et al, 1990). 
Sequencing reaction products were denatured in fbnnamide and resolved cm a 6% w/v 
polyacrylamide gel containing 8M urea, fixed in 1 0% v/v acetic acid, 1 0% v/v methanol 
and dried Following autoradiography on XK1 film (Kodak) the gel sequences were 

30 read assembled, translated to protein and aligned using the PC/GENE suite of programs 
(IntelliGenetics, USA). 
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TABLE 1 
PCR PRIMERS 



1 PRIMER 


SEQUENCE 1, 3 


POSITION 2 


CM 


TGGAAGGGCTAATTTGGT(6 1 6) 


1-18 


Cl-2 


ATCTTCCCTAAAAAATTAGCCTGTC(6 1 7) 


2099-2075 


LTR-3' 


AGGCTCAGATCTGGTCTAAC(6 1 8) 


9559-9540 


SK68 


AGCAGCAGG AAGCACTATGG(6 1 9) 


7786-7805 




TGCrAGAGATTTTCCACAC(620) 


9709-9691 


KS-2 


AGTGAATAGAGTTAGGCAGG(62 1 ) 


8326-8345 


RT5'-v3 


GTAAGACAGTATGATCAGATA(622) 


2418-2438 


RT3'-v2 


TTGTAGGGAATTCCAAATTCC(623) 


4660-4640 


RT5'-v2 


CAGGATCCTACACCTGTCAACATAAT(624) 


2487-2506 


RT3'-vl 


GGGAATTCCTTATTCCTGCTTG(625) 


4655-4634 



1. Sequence is presented from 5' to 3' of the primer. 

2. Position is according to the numbering of HIV NL4-3 in Myers et al (1992). 

3. SEQ ID NOs are given in parentheses. 
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TABLE 2 
SEQUENCING PRIMERS 



PRIMER 


SEQUENCE 1, 3 


posmoN 2 


KS3 


CCAGAAGTTCCACAATCC(626) 


8570-8553 


KS4 


TTCTTCTAGGTATGTGGAG(627) 


8753-8735 


KS5 


AGTQAATTAGCCCTTCCAG(628) 


9093-9075 


KS6 


TGCTAGAGATTTTCCACAC(629) 


9709-9691 


SP2 


TGCTCTGGAAAACTCAT(630) 


8006-8022 


SP3 


CTTTCTATAGTGAATAGAG(63I) 


8318-8336 


SP4 


TATTGGAGTCAGGAACT(632) 


8618-8634 


SPR. 


GGTCTAACCAGAGAGAC(633) ■ 


9547-9531 



15 1. Sequence is presented from 5' to 3' of the primer. 

2. Position is according to the numbering of HTV NL4-3 in Myers et a! (1992). 

3. SEQ ID NOs are given in parentheses. 

EXAMPLE 4 

20 Cells and Cell Culture 

Peripheral blood was obtained from HIV-1 sero-negative volunteers and mononuclear 
cells prepared by centrifugation on a Ficoll/Hypaque density gradient (Peper et al, 1968). 
PBMC were activated with phytohcmagghitmin (PHA; 1 Ofig/ 1 0 6 cells) for 4 8 h at 3 7°C 
washed and then cultured in RPMI 1640 medium containing 10% v/v heat inactivated 

25 foetal calf serum, 15mM HEPES, 0.1% v/v sodium bicarbonate, 25pg/ml polybrene 
(Sigma), 10% v/v interleukin 2 (Boehringer Mannheim) and 1:1000 anti-urterferon 
(Miles) (IL-2 medium). Non-FHA stimulated cells were prepared in a similar manner 
except.they were cultured in medium lacking PHA and IL-2. 
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EXAMPLES 

Antipeptide-aniisera 

Antibodies specific for HTV-1 Nef were raised against a peptide corresponding to the 
predicted amino acid residues 15-27 (AVRERMRRAEPAA SEQ ID NO: 634) of Nef 
5 encoded by the HTV-1 clone pNL4.3 (Kemp et al, 1988). The peptide was conjugated 
to keyhole limpet hemocyanin (KLH; Calbiochem, Behring Diagnostics, CA) via 
glutaraldehyde and this complex used to immunise sheep (O.Smg peptide 
conjugate/sheep). Antibodies to the peptide were purified by affinity chromatography. 
Reactivity of the antibodies with recombinant HIV-1 Nef 25 and 27 was demonstrated 
10 immunoblotting. 

EXAMPLE 6 

Reactivity ofanti-Nef ^ 2 7) *** 1117 C18-infected Cells in Immunoblotting 
Seven days post-infection HTV-1 C18-infected PBMCs and mock - infected cells were 

15 washed in PBS then lysed (0.5% w/v NP-40, 0.5% w/v sodium deoxycholate, 50mM 
Nad, 25mM Tris-HQ, lOmM EDTA, 0.01% w/v sodium azide and lOmM 
pihenylmethylsulphonylfluoride). After nuclei were spun out lysates were 
electzophoiesed in a 13% w/v SDS-polyacrylamide gel (SDS-PAGE) and subsequently 
transferred to Hybond-C nitrocellulose (Amersham, Buckinghamshire, England) for 1 

20 h at 100 V using a Bio-Rad protein transfer cell (Bio-Rad, Richmond, Ca). Membranes 
were pre-incubated with 1% w/v BSA/PBS for 2 h at rocnn temperature and then reacted 
with affinity purified sheep anti-Nef^^, diluted 1 : 1 00, overnight at room temperature. 
After three washes in 1% w/v BSA/PBS, the blots were incubated with donkey anti- 
sheep Ig conjugated to biotin (Amersham, diluted 1:500) for 1 h at room temperature. 

25 After extensive washing as described above the membranes were incubated with 
streptavi din-conjugated horse radish peroxidase (Amersham; diluted 1:500 for 1 h at 
room temperature.' All dilutions were made with 1% w/v BSA in PBS, After further 
washing the membrane was developed with phenylenediamine substrate (Dako, 
Dapopatts, Denmark). The antibody preparation used in the immunoblotting experiments 

30 was free of detectable antibodies to the immunogenic carrier protein and coupling 
reagent 
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EXAMPLE 7 

Analysis by Polymerase Chain Reaction Amplification 
A 5' fragment defined by primers CM and Cl-2 containing the 5' LTR and part of the 
gag gene was amplified. DNA from HTV-1 C18 infected PBMC gave an amplified 
5 fragment (amplimer) of about 1 .9 kb compared with 2. lkb for pHXB2 control template, 
implying a deletion of about 200bp from HTV C18. Further amplification of this 
fragment with primers defining the U3 region of the LTR (CH and LTR-3*) gave 
amplimezs of about 300 bp for HTV-1 C18 infected PBMC DNA compared with 340bp 
for CI 8 and D36 PBMC DNA and 484 bp for pHXB2 control. This implies the loss of 
10 approximately 140 to 180bp from the U3 region of these proviral DNAs. 

To analyse the /i*/-gcne-3' LTR region, the nested primer pairs SK68-C1-6 and KS-2- 
LTR-3' were used in a double PCR. Amplimers of approximately 830bp were obtained 
for HTV-1 C18 infected PBMC DNA as well as for PBMC DNA from Donor D36 and 
1 5 Recipients CI 8, C54 and C98 compared with approximately 1230bp for pHXB2 DNA. 
These results suggest that about 400bp of DNA have been deleted from the Donor and 
Recipient proviral DNAs. 

In comparison, amplification of the polymerase gene region by double PCR with the 
20 nested primer pairs RT5'-v3-RT3'-v2 and RT5'-v2-RT3'-vl gave a fragment 
(approximately 2.1 kb) the same size as the molecular clone pHXB2 fragment for HIV-1 
C18 infected PBMC DNA, suggesting that deletions from tins region were unlikely. 

EXAMPLE 8 

25 Nucleotide Sequence of the nef>3 'LTR Region 

PCR amplification experiments indicated an approximately 200bp nucleotide deletion 
from both the nefgpne and LTR regions of Donor D36 PBMC and Recipient CI 8 HTV-1 
proviral DNA. To further analyse these regions, the DNA sequence was determined for 
the PCR amplified nef-3 -LTR region of D36 PBMC, CI 8 isolates HTV^^ and HIV SfV 

30 as well as isolate C98 HTV infected PBMC proviral DNA- The 3' region was amplified 
with outer primers (SK68-C16) and inner primers (SK68-LTR 3' or KS2-C16) and 
sequenced directly using a number of internal sequencing primers based on the HTV-1 
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NL4-3 nucleotide sequence (Table 2). 

Alignment of the nucleotide sequences of the amplified 3' region of donor D36 PBMC 
and recipient CI 8 isolates HTV MBC and HTV StV and C9S HIV (Fig 1) showed a number 
S of nucleotide sequences changes, including deletions, relative to the nucleotide sequence 
of wild-type infectious HIV-1 (HIV-1 NL4-3). In the region of alignment, D36 PBMC 
lacked 291 nucleotides, CI 8 HTV stv differed in size by 388 nucleotides (comprising 
deletions of 397 nucleotides and an insertion of 9 nucleotides), CI 8 HTVj^ differed 
by 456 nucleotides and C98 HIV lacked 1S8 nucleotides compared with HIV-1 NL4-3. 
10 The overall identity with HIV-1 NL4-3 nucleotide sequence of D36 PBMC, CI8 
HIV StV , HIV]^ and C98 HIV nucleotide sequences, including deletions, was 73% 
(1157/1596), 67% (1459/1592), 62% (982/1592) and 79% (1105/1399), respectively. 

The D36 PBMC sequence differed from HIV-1 NL4-3 in a number of features. A 
15 change in the wild type tat termination codon from TAO to TCG (Ser) extended the 
third tat exon (which starts at splice acceptor 10) by a further 15 amino acids to 
terminate at a conserved TAG (Fig 1). The resulting C-terminal peptide is rich in 
charged amino acids (8/15) (Fig 2a). The wild type rev termination codon has also 
changed (TAG to GAG, Glu) and the third rev exon is extended for 14 codons to 
20 terminate at a conserved TAG (Fig 2b). The encoded extra amino acids are mainly 
polar (11/14) and charged in nature (Fig 2b). The sequence also encodes the <>terminai 
237 amino acids of Env gp41 (Fig 3) terminating at the normal termination codon. The 
D36 PBMC Env amino acid sequence has 85% identity with the HTV NL4-3 sequence, 
increasing to 89% if similarities are included. 

25 

There are significant differences fiom HIV-1 NL4-3 downstream of the env (gp41) gene. 
A change in the fifth ne/ codon, fiom TGG (Tip) to TGA (Fig 1), introduces an early 
termination in the D36 PBMC nef gene. The encoded Kef protein is identical to the N- 
terminal 4 amino acids of NL4-3 Nef (Fig 4). Following the early termination there are 
30 deletions of 33, 47, 93 and 91 nucleotides and a region of low sequence homology, 
compared with HTV NL4-3, prior to the wild type ^/termination codon site (HTV NL4- 
3 nts 9405-9407). As well as removing a significant part of the nef gene, these deletions 
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also bring into phase a further 6 termination codons. While the polypurine tract (plus 
strand primer binding she) and the first 38 nucleotides of the LTR U3 region are 
perfectly conserved, downstream U3 region sequences are disrupted by the 93 and 91 
nucleotide deletions and the low homology region. The resulting U3 region lacks 
5 recognition sequences for die transcription factors c-myb t USF and TCFla as well as 
one of the suggested NF-AT sites (Gaynor et al, 1992). Downstream torn the 91 
nucleotide deletion, a 59 nucleotide region of low homology contains two extra NFKB 
enhancer sites 19 nucleotides upstream of the usual site of a pair of NFKB sites, the 
upstream one of which is altered in its 5'-half in D36. Sequences further downstream 
10 are highly conserved with respect to HIV-1 NL4-3, including the position and number 
of Spl basal promoter sites, TATA box, TAR and polyadenylation signal sequences. 

Similar to D36 PBMC, the C18 HTV StV andHIV]^ sequences show the tat third exon 
to be extended by 15 codons. All but two codons (altered by point mutations) are 
15 identical to those of D36 PBMC (Fig 2a). The rev third exon of both CI 8 isolates is 
also extended (Fig 2b) but by only three codons, identical to the first three codons of 
the D36 PBMC rev extension. The same 237 amino acid coding region of Env gp41 is 
found in both the CI 8 HIV DN A sequences (Fig 3) and shows 85% identify, increasing 
to 88% if similarities are included, with the same region of the NL4-3 Env gp41. 

20 

It is in the nef gene and LTR regions that the major differences from wild-type HIV-1 
arise, just as in D36 PBMC The nef gene of C18 HTV^y encodes 24 amino acids with 
9 of the 10 N-temrinal being identical to the NL4-3 Nef protein (Fig 4). Thereafter, 
deletions of 177 and 1 1 nucleotides cause a frameshiftattdtenninati(matthe25thcodon 
25 (Fig 1). Downstream deletions of 120, 82 and 7 nucleotides cause further loss of wild 
type nef gent sequence and bring into phase a further three termination codons. 

The nef gene of C18 HTV^^ encodes only 7 amino adds with only the initiator 
methionine identical to the NL4-3 Nef protein. This loss of identity and early 
30 termination is bought about by a 250 nucleotide deletion after Ac fifth nucleotide of 
the nef gene. Downstream deletions of 120 and 86 nucleotides cause further loss of 
wild-type nef gene sequences. In both CIS isolates there is perfect conservation of the 
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polypurine tract and 29/31 nucleotides at the 5' end of the U3 region immediately before 
the 120 nucleotide deletion (Fig 1). This deletion together with the downstream 82 and 
7 nucleotide deletions in HTV StV and 86 nucleotide deletion in HTV MBC and the low 
homology region cause the loss of the 5' half of the NRT-1 site (Yamamoto et al 1992) 
5 and die downstream NFAT site. A third NFKB site is present 31 (HTV ttV ) and 33 
(^^MBc) nucleotides upstream of the expected pair of NFKB sites which are 
themselves separated by 13 nucleotides instead of the 4 nucleotides in HIV-1 NL4-3. 
The S'-most Spl site sequence is slightly altered but sequences downstream including 
the other 2 Spl sites, the TATA box, TAR and potyadenylation signal sequences are 
10 identical to HIV-1 NL4-3 sequence. 

The three sequences, D36 PBMC, C18 HTV^y and C18 HTV^q show a number of 
similarities consistent with the transmission of vims from person D36 to person C18 as 
well as a number of differences indicating post-transmission divergence of sequence. 

IS All three have tat open reading frames (ORFs) extended by 15 codons. All three have 
extended rev ORFs. The new rev termination codon in both C18 HIV-1 isolates, three 
codons downstream of die NL4-3 rev termination codon, has a point mutation in D36 
PBMC to make a Ghi codon so that it continues for a further 11 codons (Fig 2b) to 
terminate at a conserved TGA. The partial Env gp41 amino acid sequences are more 

20 closely related to each other (86% identity or 90% including similarities) than to HIV 
NL4-3 (85% and 89%, respectively). 

The nucleotide sequence of the nef and LTR region of the HIV-1 isolate from recipient 
C98 (C98 HIV) is 90.3% identical (1264/1399) to the HIV NL4-3 sequence, ignoring 

25 deletions. Similar to the D36 PBMC and C18 HTV StV and HIV MBC isolates the C98 
HIV sequence shows the third exon of tat to be extended by 15 codons with all but one 
being identical to the D36 PBMC tat extension. Also, the rev gene is extended by 3 
codons, 2 of which are identical to the first 2 codons of the D36 PBMC rev extension. 
The sequence also encodes the C-terminal 223 amino acids of Env gp41 terminating at 

30 the normal termination codon. The C98 HTV Env amino acid sequence has 89% identity 
with HTV NL4-3 Env sequence, increasing to 92% of similarities are included. 
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As wife the D36 PBMC and the C98 HTV isolate sequences it is the nef gene and LTR 
regions that major differences from the HTV NL4-3 sequence arise. The nef gene open 
reading frame of C98 HIV is much longer than in D36 PBMC, C18 HlV^y and 
HWmbo encoding 85 amino adds compared with 206 amino acids for NL4-3. Sixty 
5 eight of those 85 amino acids are identical to the N-terminal sequence of NL4-3 Nef. 
The single, small deletion (16 nucleotides) in the C98 HIV n^alone regions (Table 3) 
occurs after nefcodon 82 causing a frameshift and termination after a further 3 codons 
at the start of the highly conserved polypurine tract sequence immediately before the 3'- 
LTR The nefLTR region has two deletions totalling 142 nucleotides. The 5'-most 

10 deletion of 42 nucleotides includes the splice acceptor 12 sequence. The NRT-1, dyad 
symmetiy and myb response element sequences are all intact However, the downstream 
100 nucleotide deletion includes sequences from the 3' end of the 5VNF-AT and all of 
the y NF-AT sequences as well as the USF transcription factor recognition sequence. 
The downstream low homology region of 77 nucleotides lacks the TCF-lot sequence but 

15 has two additional NFKB sites 13 nucleotides apart and 26 nucleotides upstream of the 
3'-half-remmant of the normal 5 -NFKB site. Sequence downstream, including the 3'- 
NFKB site, the 3 Spl sites, TATA box TAR and polyadenylatkm signal sequences are 
all highly conserved. 

20 The main feature of the sequences is the series of deletions, with respect to HIV NL4-3, 
in the nef gene-3'-LTR region. These can be grouped into two regions namely the nef 
alone region, that part of the nef gene upstream of the LTR, and the ne/TLTR region, 
where the nef gene and LTR U3 regions overlap. The deletions in these regions of each 
of the sequences start and end at the same or similar positions (Table 3). The deletions 

25 are larger in C18 HTV^y and C18 HIV MBC sequences where totals" of 397 and 456 
nucleotides have been deleted (relative to NL4-3) compared to 291 nucleotides, from 
D36 and 158 nucleotides from C98 HIV. In the iz^-alone region the two deletions in 
C 1 8 HTVgfy and the single deletion in CI 8 HTV^^ occupy fee same region as fee three 
deletions in D36 PBMC. Similarly, the w/TLTR region in the three deletions in the C18 

30 HTVggv, fee two deletions in fee C18 HIV MBC and fee D36 PBMC sequences occupy 
fee same region. These findings indicate that mutant virus was transmitted from D36 
to C18 after which further deletions and rearrangements occurred. Similarly, fee 
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sequence of C98 HIV in the neJILTR region indicates two deletions occupying the same 
region as the neflLTR deletions in D36 and the CI 8 sequences. However, the size (only 
16 nucleotides) and the position of the deletion in the nef-siow region of C98 HIV are 
distinct from those of the D36 PBMC and C18 sequences. 

5 

The tuning of transmission of virus by transfusion was that recipient C18 was transfused 
approximately 19 months after C98. Consistent with the relative timing of transmission 
and the sequence similarities and differences is the suggestion that at the time of 
transmission to C98, the D36 sequence had deletions in the neflLTR region but not in 

1 0 n^alone region. After transmission to C98, the C98 virus developed further deletions 
and rearrangements, including the deletion in the n^alone region. The D36 virus 
evolved so that at the time of transmission to C18, further deletions and rearrangements 
had occurred including deletion of sequence from the nefalone region distinct from the 
C98 HIV w^alone region deletion. After transmission to CI 8, further deletions and 

15 rearrangements occurred in the C18 virus giving rise to at least two sequences (HIV stv 
and HTV^c). 

The nef-alons deletion region may be a mutation or recombination "hotspot" as it 
includes sequences that were found to be variably duplicated in 28 out of 54 Nef protein 

20 sequences derived from 8 of 12 patients analysed in a study (Shugars etal 1993). The 
sequence between the n^alone and the neflLTR region deletions is highly conserved 
and is important in pro vims integration into the infected cell genome and interacts with 
a number of cellular proteins. It is interesting that the sequence equivalent to NL4-3 
nucleotides 9209 to 9225 is retained in D36 and C98 HIV but lost in the C18 HTV 

25 sequences. This includes part of a sequence of dyad symmetry (9210 to 9231) and is 
a significant part of the binding site for NRT-1 (Yamamoto et al 1991) which has been 
shown to have a negative regulatory effect on HIV- 1 expression. The presence of this 
sequence in D36 and C98 HIV and its absence from the CI 8 isolates may correlate with 
the inability to isolate virus from D36 PBMC and the poor replication of C98 HTV but 

30 the ability to isolate HTV-1 from CI 8 PBMC. The deletion of sequence equivalent to 
nucleotides 9281 to 9395 of NL4-3 causes the loss of some transcription factor binding 
sites including NFAT and USF from the D36, C18 HTV and C98 HTV sequences. 
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A further similarity between the D36, C18 HIV SlV , C18 HTVj^ and C98 HIV 
sequences is a region of low homology to HTV-1 NL4-3 extending downstream of the 
nefUYR deleted region to the NFKB enhancer/Spl promoter she region. This low 
homology region in feet consists of incomplete duplications of part of the NFKB/Spl 
5 region (Fig 5) resulting in D36 and C98 HIV having 2 extra NFKB sites upstream of 
an altered 5' NFKB site while the C18 sequences have one extra NFKB site and altered 
spacing between the 5' and 3 f wild type NFKB sites due to an insertion of 9 nucleotides. 

For the CI 8 and C98 HIV- 1 isolates virus replication was assessed in PHA-stimulated 
10 and non-stimulated PBMCs (Figs 6 and 7). In PHA-stimulated PBMCs we also studied 
cell surface CD4 and IL-2R expression (Fig 8). In comparison with HTV-1 wild type 
SI and NSI isolates clearly both C18 HTVj^ and C98 viruses are replication 
competent, though C98 HIV replicates more poorly than C18 HTV^c and are of the 
NSI phenotype when syncytium formation and CD4 and IL-2R surface expression are 
IS taken into account Additionally, and more surprisingly, these two viruses replicated 
almost as efficiently in non-PHA stimulated PBMCs when compared to a typical local 
wild type SI isolate (HIV-1 228200, Fig 7). 

When protein expression was assessed for C18 HTV^c and C98 HIVi^^ although 
20 structural proteins were identified, no typical Nef protein was seen in infected cells. 
However, analysis of cell lysates prepared from PBMC infected with C18 HTV^^ or 
PBMC infected with C98 fflVj^ (which were subsequeody stimulated by UV 
irradiation, see Valerie et al 1988) by Western immunoblotting using two antibodies 
specific for the N-terminal region of Nef showed the presence of smaller proteins of 1 9 
25 kDa and 21 kDa, respectively. These proteins were not observed in mode-injected 
control PBMC lysates and were not observed when the iniected-cell lysates were probed 
were probed with antibodies reactive with the C-tenninus of Nef. 

Thus/although the C18 and C98 HIV isolates are replication competent in vitro they 
30 clearly replicate differently using different conditions for cell activation and from the 
known functions of HTV-1 Nef protein and die LTR show that the major deletion in the 
nef gene and/or the LTR is at least in part responsible for the outcome of infection, 
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implicating the importance of Nef and/or the LTR in the clinical outcome of infection 
in vivo. 

EXAMPLE 9 

5 Determination of Degree of Relatedness Between Viruses 

To determine the degree of relatedness between viruses such as between mutants or 
between mutants and a wild-type virus and to ascertain putative infected patients, the 
method of Delwart et al was employed* 

10 EXAMPLE 10 

Immune responsiveness of subjects infected by non-pathogenic HIV-1 isolate 
In this example, the donor and recipients of the cohort were tissue typed and assessed 
for fra sfc cellular nimipng responses. Proliferative responses and EL-2 production to the 
mitogens ConA and FHA, to allogeneic mononuclear cells (irradiated pooled 

IS mononuclear cells from 20 random donors) and to recall antigens (e.g. influenza and 
tetanus toxoid) were within normal ranges. While at the immunogenetic level, HLA 
typing failed to identify a consistently common allele or haplotype within the group. 

The conservation of CD4+ counts observed in the cohort, the relative integrity of their 
20 immune system and the varied HLA types of the donor and recipients further supports 
the fact that the symptomless condition of the cohort members is due to a non- 
pathogenic drain of HTV-1 or a strain of low virulence. 

Accordingly, this provides a screening procedure for subjects putatively infected by a 
25 non-pathogenic HTV-1 isolate where such subjects are seropositive for HTV-1 (eg. have 
antibodies to an HIV-1 glycoprotein) yet have normal proliferative responses and 
cytokine production to mitogens, allogeneic mononuclear cells and to recall antigens* 
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EXAMPLE 11 

Clinical Immunology of Cohort 
To establish that the donor and the recipients belonging to the cohort exhibit normal 
5 immunological profiles, members of the cohort were assayed for GD3, CD4, CD8, 
lymphocyte count, CD4/CD8 ratio and (^-microglobulin over time since 
seroconversion. 

Parameters considered normal in non-infected individuals range as follows: 

10 ; 









CD3 


55-82% 


620-2200 (xl0 6 /L) 


CD4 


29-58% 


420-1410 (xltf/L) 


CD8 


12-43% 


200-980 (xlO 6 /!,) 


Lymphocyte count 


1000-3500 (xl0 6 /L) 




CD4/CD8 


0.7-3.7 




jJ-2-microglobulin 


0.00-220 mg/L 





20 The results are shown in Figures 10(aHg) and clearly show that die immunological 
profiles of cohort members are substantially normal further highlighting the non- 
pathogenic^ of die HIV-1 isolates of the present invention. Figure 10(g) shows a graph 
of the Kaplan-Meier (Ox and Oates, 1989) estimates of time to disease progression 
(AIDS or GD4 > 250). The results demonstrate thai fee difference is large in spite of 

25 the conservative bias, with a median time to progression of 6.2 years in the main 
database. An exact logrank test (Cytd Software Corporation, 1989, StatXact Statistical 
Software for Exact Nonparametric Inference. Cambridge, Massachusetts.) was 
performed, demonstrating that the difference between the groups was highly statistically 
significant (logrank statistic 1 L8, pO.0001). 

30 
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EXAMPLE 12 
Sequencing of isolate HIV-1 CI^mbc 
The genome of variant HIV-1 designated C18 HTVM^c was amplified by the 
5 polymerase chain reaction (PGR) as 7 overlapping fragments using die sets of inner and 
outer oligonucleotide primers, designed using the programme PGRPLAN 
(IntelliGenetics), listed in Table 5 and either UlTma (Applied Biosystems) or a mixture 
of KienTaq and Pfu (KienTaq LA, Ab Peptides Inc) polymerases (for faithful 
amplification of long fragments). The resulting fragments were cloned into the Smal 

10 site of the plasmid vector pGEM7Zf+. Insert-containing clones representing each region 
of the full length variant HIV-1 were sequenced by a nested deletion strategy (Gou & 
Wu, 1982) and cycle sequencing with Taq polymerase and dye labelled primers 
complimentary to the T7 or SP6 sites within the cloning vector. Nucleotide sequences 
were altered and collated by ASSEMGEL and SEQIN (IntelliGenetics) and SEQED 

15 (Applied Biosystems) and translated to the encoded amino acid sequences using 
TRANSL (IntelliGenetics) programmes. Sequence alignments used N ALIGN, 
CLUSTAL (IntelliGenetics) and SEQED programmes. 

The full length sequence (Fig 9; SEQ ID NCV.800) of isolate HIV-1 018^ is 9207 
20 nucleotides long which is 506 nucleotides shorter than the HIV NL4-3 sequence. This 
size difference is comprised of 126 nucleotides of insertions and 632 nucleotides of 
deletions, see Table 6. The most extensive differences between the HIV-1 ClSj^ 
sequence and HIV-1 NL4-3 are in the U3 region of the LTR and in the nef gene, as 
hereinafter described 

25 

The 5' LTR has deletions of 120 and 87 nucleotides and a region of low sequence 
homology, which is the result of an imperfect duplication of the downstream NFkB and 
Spl response sequences. These result in the loss of sequence from a number sites 
important in the regulation of transcription of HIV-1 genes, including the negative 
30 response element (NRE) and the response elements for a number of transcription factors 
including NF-AT, NRT-1, USF and TCF-Icl Furthermore, the low homology region 
contains an extra NFkB and Spl sites as well as an insertion of 9 nucleotides between 
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the usual NFkB sites. Downstream of the NFkB sites the sequence of the LTR has a 
high level of homology (96.2%) with the same region of HTV NL4-3. 

The gag gene contains 3 insertions, which represent direct repetitions of adjacent 
5 sequences. The first is a perfect repeat of 15 nucleotides after the equivalent of 
nucleotide 1 134 of HTV-1 NL4-3 and adds 5 amino acids to the C-terminus region of 
pi 7^. The remaining 2 insertions are imperfect and perfect repeats of 30 and 6 
nucleotides, respectively, after the equivalent of HIV NL4-3 nucleotides 2163 and 2232, 
respectively. These encode an extra 12 amino acids in the C-terminus region of pl5 gag 
10 just downstream of the gag to pol frameshift sequences. The variation in sequence 
length of the gag gene at these two positions is unusual. The homology of die encoded 
amino acid sequence of HTV-1 018^ and HIV NL4-3 for the gag pi 7, p24, and pl5 
proteins is 87.1%, 93.5% and 94.3%, respectively. 

IS In the pol ORF, the encoded proteins have high homology with the NL4-3 sequences 
being 95.5% overall comprising plO protease 92.9%, p66 reverse transcriptase 95.4% 
and p34 integrase 95.8%. The amino acid sequence of the p61 RT lacks the mutations 
nsfionfltH with resistance to the nucleoside (AZT, ddl, ddC)and non-nudeoside 
(Nevirapine) analogue drugs used in the treatment of HTV- 1 -infected persons. 

20 

The vjfgene encodes a 192 amino acid protein with 88.0% homology with that ofHTV-1 
KL4-3. The vpr gene encodes a 96 amino acid protein with 89.6% homology with that 
ofHIV-1 NL4-3. 

25 There are 2 insertions and 1 deletion of sequences in the vpu gene. The insertions of 
3 and 9 nucleotides are after die equivalent of nucleotide 6071 and 6234, respectively, 
of HTV-1 NL4-3. These add 1 amino add after amino acid 3, and 3 amino acids after 
amino acid 59 of the encoded Vpu protein. The deletion of 12 nucleotides after die 
equivalent of HTV-1 NL4-3 nudeotide 6261 deletes 4 amino acids from the C-terminal 

30 region of Vpu as well as from die signal peptide of the env poiyprotein, which is 
encoded by an overlapping reading frame. Amino acid sequence homology of HTV-1 
018^ Vpu with NL4-3 is 85*2%. 
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The sequence encoding the em gpl20 has 9 insertions totalling 45 nucleotides (encoding 
15 amino acids) and the deletion of a total of 18 nucleotides (encoding 6 amino acids). 
These are listed in Table 6. All of these events (insertions and deletions) are at positions 
in the env gene. This is within the env V3 coding region, immediately upstream of the 
5 sequence encoding the so called V3 tip (or loop) amino acid sequence, Gly Pro Oly Arg. 
The V3 region sequence is that of a typical clade B subtype (North America, Europe and 
Australia) being identical to the clade B consensus sequence (based on 186 env 
sequences) at 29/35 positions. The type of amino acid at positions 11 and 28 of be V3 
loop region (where position 1 is the Cys at amino acid 266 of the env gpl20) is 

10 predictive of the viral non-syncytimn / syncytium forming phenotype (Fouchier et al, 
1992). The HTV-1 Cli^Q env gene encoded amino acid sequence has Ser at position 
1 1 and lie at position 28 of the V3 loop region. The lack of a positively charged amino 
acid at both positions is strongly indicative of a non-syncytium viral phenotype. The 
overall amino acid sequence homology with HTV-1 NL4-3 (ignoring deletions and 

15 insertions) is 86.1%, comprising 85.5% for the gpl20 region and 87.6% for the gp41 
region. 

Both the tat and rev second exon open reading frames (ORF) axe longer than in HTV-1 
NL4-3. A change of the tor termination codon from TAG to TCG extends the tat ORF 
20 to a downstream in phase termination codon extending the encoded tat amino acid 
sequence by IS residues, compared with the 86 amino acid long NL4-3 tat protein, to 
a total length of 101 amino acids. However, tins is the usual length of the HIV-1 tat 
protein* 

25 Similarly, the normal rev termination codon is changed from TAG to GAG. This 
extends the rev ORF to an in-pbase termination codon 3 codons downstream so that the 
encoded Rev protein is 119 amino adds long instead of the usual 116. 

As mentioned above the most extensive differences between the sequences of the isolate 
30 HIV-1 C18 MB £ and HTV-1 NL4-3 are in the /u?/ gene and the LTR region. While the 
nefgisDS overlaps the 3* LTR, these differences are found in both the nef alone and the 
nef I LTR overlap regions. The HTV ClS^^-encoded nef protein is only 24 amino 
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acids long compared with the normal length of 206 amino acids. This severe shortening 
of the nef protein is due to the deletion of 188 nucleotides (the 177 and 1 1 nucleotide 
deletions) from the /ze^alone region which also brings into phase a termination codon, 
TAG, at the resulting 25th codon. Downstream there is further loss of potential nef gene 
5 sequences by the 120 and 87 nucleotide deletions situated in the nef I LTR overlap 
region. The resulting 24 amino acid nef protein is identical to the N-tenninus of the 
HIV-l NL4-3 nef at 9 of the first 10 positions. Thereafter, homology is lost completely. 

Some sequences used in the generation of mature mRNAs are altered or lost in C 1 &mbq- 

10 The dinucleotide immediately after the splice donor site 2 (SD2) at nts 48 18-481 9 (NL4- 
3 equivalent nts 4963-4964)is changed from the conserved GT to GC. It is expected 
that this change would lead to loss of function of this site as a splice donor. Splice 
donor 2 is used in the processing of HIV-l transcripts to some of the mRNAs that 
encode Tat, Rev and nef proteins. Similarly the splice acceptor site 7 (SA7) sequence 

IS at nts 6477-6478 (NL4-3 equivalent nts 6602-6603) is changed from the conserved AG 
dinucleotide to TC. This change is expected to lead to loss of function of this site as 
a splice acceptor. While this SA site is used in HIV-l mRNA processing it is not a 
major site and is not used in the production of the regulatory proteins (Tat, Rev or nef) 
mRNAs. The splice donor 12 site is absent from the €18^^ sequence (NL43 

20 equivalent nts 9161-9162) as it is within the first deletion region in the nef I LTR 
overlap region which occurs at nt 8797 and results in the loss of NL43 nucleotides 9105 
to 9224. It is significant that the SA12 site is absent from the sequence of all of the 
cohort virus isolates so far obtained as well as from the sequence of D36 PBMC, 
however, the C54 PBMC sequence does contain the SA12 she. SA12 is not used in the 

25 processing of mRNAs that encode the viral regulatory proteins. Normally SA12 is used 
in splicing in conjunction with SD1, 2, 3 and 4 and the resulting spliced RNA is 
probably not a mRNA but may have a regulatory role involving binding to cellular 
proteins (Smith et al, 1992). 

30 An interesting feature of the sequence of the HIV-l CI 8^^ isolate is the deletion and 
rearrangement of sequence from the 5'-LTR U3 region and the deletion of sequence 
from the nef gene (both nef alone and nef I V LTR regions). These being the only 
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features of the sequence distinct from disease-causing HTV-1. The lack of AIDS or 
AIDS-Iike symptoms in tire patient C18 is attributed to the effects of the loss of LTR 
sequence and / or the loss of nef coding sequences and their role in the pathogenesis of 
AIDS, 

5 

TABLE 4 

Primers used to Amplify Overlapping region* of HTV-1 ClSj^g^ 



10 




5'- 
OoordlMt* 


Dirtotion 
(♦/-> 




Primmr 0*qu«nc« 
Length (nt) 


15 


CL OA 


1 




30 




CL 14 


696 




25 


JUU'UvllLIASCTCCCll^TIUUX 




CL IB 


1 


♦ 


30 






CL 13 


796 




31 




20 


CL U 


682 


♦ 


23 


TCictcaxcsatoGftCTa^crT 




Or 18 


3440 




30 


CiUl 1 1 i LlbCCAbl ILlifaC i'LlWlTOl 1 




CL 12A 


732 


♦ 


26 




25 


CL 1? 


3330 




32 


£S£2^aiunTQCcaumcMmTcc£»c 


CL 26 


3133 




39 


cacAccsuGaawMAsaaauaMa^ 




CL CB 


9671 




39 




30 


CL 27 


3251 




39 




CL 28 


€39 




37 
■ 






CL 19 


5448 




30 






CL 24 


B422 




26 





35 

Underlined sequences depict added restriction enzyme site 

+ and - {mentations refer to sense and anttoense strands of Hie double stranded DNA 
sequence 
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TABLE 5 

Sequence Deletions and Insertions in H1V-1 018^^^. 
Compared with HTV-1 NL43 



10 



15 



20 



25 



30 



35 



Gene or Region 


Position 

now 


(nt) 
NL43 


Deletions 
(nt) 


Insertions 
(nt) 


5' -LTR TJ3 


29 


29 


120 




5' -LTR TO 


85 


205 


87 




5' -LTR TO 


154 


360 




9 




939 


1134 




15 




1982 


2163 




30 










c 
o 


vdzi 


9927 






3 




oU92 


6234 




9 


env . 


0128 


6261 


12 




env 


6483 


6628 




6 


G33V 


6514 


6653 


2 




env 


6524 


6665 


1 




env 


6630 


6772 




9 


env 


6646 


6778 




3 


env 


7011 


7141 


6 




env 


7140 


7276 


3 




env 


7195 


7334 




6 


env 


7266 


7399 


3 




env 


7278 


7414 




6 


env 


7290 


7420 




2 


env 


7300 


7429 




. 1 


env 


7314 


7441 


3 




env 


7463 


7593 




3 


env 


7471 


7598 




9 


net 


8711 


8829 


177 




nef 


8723 


9018 


11 




net / LTR 


8798 


9104 


120 




nef / LTR 


8854 


9280 


87 




Z»TR U3 


8923 


9435 







W09M1912 



PCT/AU95/D0063 



-66- 

EXAMPLE 13 

Macrophage Isolates ofHIV-1 CI 8 and MV-1 C98 

5 HRM has been isolated from the macrophages of patients C18 and C98. 

Patient monocytes were prepared as follows. Whole blood was spun at 2000rpm for 10 
minutes. Plasma was removed into a separate tube and the remaining cells were diluted 
1:2 in PBS" (magnesium and calcium free phosphate buffered saline). This was 
10 underlaid with 10 ml of Ficoil Isopaque and spun at 2000rpm for 20 minutes. Cells 
were collected from the interphase and washed three times with PBS". These cells were 
then seeded into a 6 well Costar tray at a concentration of 1.0 x 10 7 Anl and allowed to 
adhere for 1 hour. Any non-adherent cells were removed by aspiration. 

15 Donor HIV-1 negative macrophages for use in co-cultivation were prepared as follows. 
Peripheral blood mononuclear cells were purified from whole blood using 
FicoMsopaque density gradient These cells were seeded at a concentration of 2.0 x 
10 6 /ml in teflon* PBMC were cultured in the presence of 3pg/ml ofPHA and lOOOU/ml 
of M-CSF 3 days prior to co-culture. 

20 

On day of co-culture, donor PBMC were CD8 depleted Dyna beads coated with and- 
CD8 were used for this purpose. Dyna beads were washed once in PBS" and then 
applied to a magnet for 3 minutes. Supernatant was removed and the beads were then 
resuspended in 250/d of RF-10. Aliquots of 2.0 x 10 s patient cells were then added to 

25 250/J (3 beads ; 1 CDS T-cell) of Dyna beads and allowed to incubate for 30 minutes 
on ice with occasional mixing. After 30 minutes the cell suspension was placed onto 
a magnet for 3 minutes. Hie supernatant was then removed placed into a second tube 
containing 142 /ri (1 brad : 1 CD8 T-cell) of Dyna beads. This suspension was placed 
on ice for an additional 30 minutes with occasional mixing. After 30 minutes cell 

30 suspension was placed onto a magnet for 3 minutes. Supernatant was removed and 
washed once in RF-10. 
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For co-culture, CD8 depleted PBMC were then added to patient monocytes. Half media 
changes were done every 7 days for a period of 21 days. Aliquots of 2.5 ml of medium 
was removed from these cultures and replaced with CD8 depleted donor PBMC in 
Iscoves containing 10% HuS (Human serum), 5% v/v FCS and 5% w/v IL-2 and 
5 lOOOU/ml of M-CSF. Harvested supematants were spun at 1400 rpm for 10 minutes 
and stored as 1ml aliquots. Cell pellets were lysed in 200/xl of lysis buffer for PCR 
analysis. Infection was quantitated using a p24 EIA Kit. 

Cells were harvested from the co-cultures and used to prepare DN A as described above. 
10 The nefl 3'-LTR region of both virus isolates was amplified by PCR using the above 
described primer sets and conditions (Example 12). The resulting ampiimers were 
cloned into the plasmid vector pT7T3U19 and the nucleotide sequence determined by 
the Taq cycle sequencing method with dye-labelled primers. 

IS The C18 macrophage sequence has 3 deletions starting and finishing at positions within 
3 nucleotides of the same deletions in CIS^^ The encoded nef protein is 3 amino 
acids long compared with 7 amino acids for CI 8,^. The low homology region of the 
LTR U 3 region of CI 8 macrophage is very similar in sequence to CI 8;^ and 
similarly it has one extra upstream NFkB site. 

20 

On the other hand, the sequence of C98 macrophage has a number of differences from 
the C98 isolate. While it has exactly the same first deletion of 16 nucleotides just 
upstream of the potypurine tract (PPT), in the «e/-alone region, and exactly the same 
second deletion (position and size) it has an extra deletion of 18 nucleotides at NL4-3 
25 equivalent nucleotides 9206 to 9223. The final deletion is in approximately the same 
position as in the C98 isolate but is 5 nucleotides longer. The encoded wr/" protein is 34 
flminn acids long compared with 86 amino acids for the C98 isolate.' The low homology 
region is very similar to the C98 isolate, having the same 2 extra upstream NFkB sites 
and completely lacking the normal S'-NFkB site. 

30 
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EXAMPLE 14 

Construction and Use of an Infectious Molecular Clone 

5 Molecular biological techniques can be used to construct a molecular clone of, for 
example, HTV-1 CI 8^^. Two schemes may be used. In the first scheme genomic 
DNA, extracted from either the CD4 positive PBMC of the patient C18 or donor PBMC 
that have been infected with the isolate HTV-1 CI^q , is used as the template for 
polymerase chain reaction amplification, using thermostable polymerase of high 

10 transcriptional fidelity (eg UlTma polymerase or KlenTaq/Pfu polymerase mixture), of 
long (6 to 7 kb) overlapping fragments representing the 5'- and 3'-parts of the HIV-1 
c18 mbc proviral genome of total length 9207nts. The amplified fragments may then 
be ligated together after digestion with a restriction enzyme that cleaves at a unique site 
common to the overlapping region of the amplified fragments, for example the unique 

IS Bgl I or Nco I site. Ligation of this full length proviral DNA into a plasmid vector 
will allow its propagation in E coli and the subsequent preparation of large (mg) 
quantities of this molecularly cloned proviral DNA. 

In the second scheme donor PBMC that have been infected with the isolate HIV-1 
a source of non-integrated proviral DNA which can be extracted 
from the infected cells by the Hirt extraction method (Hirt, 1967). Circular proviral 
DNA molecules may be linearised by digestion with a restriction enzyme that cleaves 
at a unique position in the genome (eg the Bgl 1 or Nco I sites). The resitting linearised 
molecules can be ligated into a plasmid or, more usually, a bacteriophage lambda (X) 

25 based vector (eg Charon 4a, AWES) after modification of the end to provide blunt or 
cohesive ends compatible with the vector. Transformation or transduction of £ coli with 
the recombinant plasmid or bacteriophage material, respectively allows the propagation 
of the proviral DNA Clones of E coli containing proviral DNA may be selected and 
DNA prepared. Molecular clones of retroviral genomes prepared in this way are often 

30 permuted* Rearrangement to the functional arrangement of sequences is achieved by 
restriction engine cleavage and religation of fragments to reconstruct the correctly 
permuted proviral genome. 
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The molecularly cloned DNA products of both schemes can be used to prepare variant 
proviral genomes that may be used as the basis of a biologically attenuated HTV-1 
vaccine strain. Similarly, they may be modified to contain extra DNA sequences in the 
nef-dlone deletion region that may deliver sequences that may be of therapeutic 
5 advantage (eg antisense or ribozyme sequences). 

Infectious virus particles of HTV-1 Clfl^c , or modified virus, can be produced by 
transfection of human cells (eg HeLa cells) which will produce, and release to the 
culture medium, virus particles of HIV-1 018^^ , or modified virus. These virus 
10 particles can be used to infect a variety of CD4 positive cells for further propagation or 
experimentation. 

EXAMPLE 15 

In vivo Primate Model 

15 Following construction of infectious molecular clones of the mutant HIV-1 strains, 
studies are then undertaken in primates to establish attenuation, immunogenicity and 
vaccine prophylactic efficacy. All studies compare mutant clones of HIV-1 with 
isogenic wild-type (WT) virus. Initial studies are performed using the macaque (A£ 
nemistrina) model of HIV-1 infection. Macaque-infectious WT HTV-1 and mutant 

20 clones are compared with respect to duration of viremia, anatomic sites of replication, 
and cellular and humoral immune responses. Where the mutant HTV-1 clones induce 
an immune response in the macques infected, challenge studies with WT virus are also 
performed. Studies are performed in a limited number of chimpanzees, generally in 
parallel with the macaque studies* Relevant mutations are engineered into WT HIV-1 

25 clones previously shown to produce chronic infection in chimps, and the course of chimp 
infection with mutant clones compared with historical controls. If infection is 
established, WT challenge studies is also performed. 
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Those dolled in the art will appreciate that the invention described herein is susceptible 
to variations and modifications other than those specifically described It is to be 
understood that the invention includes all such variations and modifications. The 
5 invention also includes all of the steps, features, compositions and compounds referred 
to or indicated in this specification, individually or collectively, and any and all 
combinations of any two or more of said steps or features. 
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(2) INFORMATION FOR SEQ ID NO: I: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9709 btse pain 

(B) TYPE ncdcrc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(H) MOLECULE TYPE: UNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NOcl: 

TOOAAOGGCT AATTTOCJTCC CAAAAAA6AC AAGAGATCCT TGATCTGIOT AXCTACCACA 60 

CACAAGGCIA CTTCCCTGAT TGGCAGAACT AC&CACCM3 GCCAGQGATC AGAXATCCAC 120 

TOACOTTGG ATGGTOOTC AAGRAGTAC CAGRGAACC AGAGCAAGXA GAAGAGGOCA 180 

AAXAAGGAGA GAAGAACAGC TIGTtAOlOC CtAIGAGCCA GCA TBG GA ro GAGGAOOOGG 240 

AGGGAGAAGT AXTAGXGTGG AAGTTTGACA GCOCCIAGC ATTTCQXCAC A3SGOCCGAB 300 

AGCXGCKXCC GGAGTACXXC AAAGACSSCT GACAimAOC TTTCXACAAG O ttAC T mUi M0. 

CXOOOC2IICTT tCCKOOQAOQ lUTGOCCTOG 605GGA CT OS GGAG1GGC2A OCOCKaOAT 420 

QCXACAXAXA AOCMCTOCT TTTTOCCtOT ACXQOSICIC tCTGGTTAGA COkOAJCXQA 400 

GCCTBGGAGC TCXCTGGCEA ACZAGGGAAC LUUUGCl'lA AflCCTCAAIA AAGCTIGCCT 540 

TOAGTOCTCA AAOXV7IGTO TOC COOTCI O TOIGTCACT OSGTAACIA OK0KTOCCTC 600 

agaccctttt JumaoTOT u OMMaercr agcagtggcg cccgaacaqg gacxtgaaag ceo 

OSAAAGIRAA GOCAGAGGAG AICICTCBAC OCMQACTOO GCTTOCTGAA QGQGBGACGG 730 

CAAGAGGCGA GG GGO0G CSA CXGGTGAGTA CGCCAAAAAX TTTOACTAGC GOAGGCEAGA 780 

AGGAGAGAGA 7GGGXGCGAG AGCGTOGGTA IXAAGG3GGG CnGAAItAGA TAAATGGGAA 840 

AAAAXTCBGT TAAGGCCKW OOGAAAOAXA CAAXAXAAAC IMMCUXT AGTAXGGGCA 900 

XGC&GGGAGC TAGAACGATT CGCAOTaMT CC T GaOjm TAGAGACAIC AGAAGOClQr 960 

AGACAAATAC TGGGACACCI ACAAOTICC CTTCAGACAG GATCAGAA6A AQTAGATCA 1030 

TTAEftXAAIA GAAZAGCAGT OC lCTKriUT CIW C A T CA AA GGUaUUTOT AAAAGACAOC 1080 

AAGGAASGCT IAGAXAAGKX AGAGaAAGAG CAAAATAAA* GXMOIIMA OGCACAGCAA 1140 

BCTGCaGCTG ATACAflflAAA CAAOGCCM aTCAGCCAAA MIlOCCCAr AGTGCAGAAC 1200 

CXCCaSGGGC AAATQGTACA TCAQGGOLTA TCACCEAGAA CRCAAKXGC A T GGG T AAAA 1260 

GBtiXAGAAG AQAAGGCTIT CAGCCCASA& GXAAXA0CC& TGTTTTCBGC ATTATGAOAA 1320 

GGAOCCACCC CACAAGATTT AAXXBCCAXG CCAAACACAG TGB GGQGAr A TCAAGCAOCC 1300 

ATGCAAATGT TAAAAGAGAC CKTaUOGAG GAA3CTOCA3 AATGGGAEAG XRGCATGCA 1440 

GTGCA73CAG GGCCXAlIUfC AOCAGGCCAG AXaAGAOAAC CMGGGGAAG T6ACAXA0CA 1500 

GGAACTACTA GTACCCITCA GGAACAAATA GSATG&HGA CACAXAA3CC ACCEATCCCA 1560 

GTAGGAGAAA TCXAXAAAAG AXGGAXAKIC C2GGGKR91A ASAAAAXAGT AAGAAXBTA3 1430 

AUJULTACCA GCaTXCTGGA CATAARAHIA GGACCAAAGG AACCCFTOG AQA1.TAIULA 1600 

GAOOBftTTCr AXAAAACTCT AAGAGCCGAQ CAAGC1TCAC AAQAGQXAAA AAATTGGATG 1740. 

ACAOAAACCT WrJUO T OCA AAATOCGAAC CCAflATTOTA AGACDHTTT AAAAGCAXXG 1100 

GGAOQU3GA0 C3ACACXAGA AGAAATGAXG ACAGCATGTC AGGGAGTGGG OGQACCOOOC 1060 

rmftAOCftA GAGTTRQGC TOAAGCAATC AGCCAAGIAA CAAATCCAGC tACCATAATQ 1920 

AZACAOAAAa OCAAlTTXAa OAACCMAOA AAfltACTlTTTA AGTGTTTCAA TTGTOOCAAA 1900 

GAASSGCACA T9G02AAAA TTGCAGGGCC GCEAS3AAA& AOGOCTOTTG aAAATCTOGA 2040 

AAOGAAOOAC AOCAAATGAA AGATTOZACT GAGAGACAGG C T A A I lXjll AGGGAAGATC 2100 

XGGCCnOCC ACAAGGGAAG OCCAGfafSAAT TTTCTTCAGA GCAGACQHGA GOGUCAGCC 2160 

COCCCAOA3W JUSAGCTTCAG GTROOQ3AA GAGATAATAA CTCCCTCTCA BAAGGAGGAG 2220 

CCGXSKGACA JGOAACTGn TCCTTXIGCr TOCCTOtOAt CACTCmOQ CAQCGACQCC - 22S0 
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TCGTOLCAAT AAAGATAGGG GGGCAATTAA AGGAAGCTCT ATTAGATACA OGDUSCMaTO 2340 

ATACAGTATT AGAAGAAATO AATTTGCCM OAAGATGOAA ACCAAAAATG ATA0GGOQAA 2400 

TTGGAGGTTT TATCAAAGTA GGACA3TATG ATGAOATACT CATAOAAATC TQCGGACATA 2460 

AAGCTATAGG TACAOTATTA CTAGGACCrA CACCTffTCAA CAEAATTGGA AGAAATCTGT 2520 

lOACTCAGAT TOGCTGCACT TTAAAITTTC CCATIAGTCC TATtGAGACT GfTACCAOTAA 2580 

AATTAAAGCC AGGAATGGAT GGCCCAAAAC TTAAAGAKXG GCCATTCSACA GAAGAAAAAA 2(40 

TAAAAGCSKXT AGTAGAAATT TGTACAGAAA TGGAAAAGGA AGGAAAAA1? TCAAAAATTO 2700 

GGCCTGAAAA TCCATACAAT ACTCCAGTAT 1TCCCATAAA GAAAAAAGAC AGTACXAAAX 2760 

GGAGAAAATT AOIAGATTTC AGAGAACTTA ATAAGAGAAC TGAAGAXTTC TOOCAACTTC 2B20 

AAITAGCAAT ACCACATCCT OXAGGOTTAA AACAQAAAAA ATCAGTAACA GTACTGGATQ 2680 

TBGGCGATOC ATATTTTTCA GTTQOCTTAO AIAAAGACTT CAGQAAOTAT ACTGCATTTA 2940 

CCATAOdAG TATAAACAAT GAGACACCAG GGWXTAGAXA TCAGTACAAT GTGCXTCG&C 3000 

AGGGATGGAA AGGATCACCA GCAAIATTCC AGTGTAOCXr OAGAAAAATC TZAGAGOCTT 3060 

TCAGAAAACA AAATCCAGAC AlAOTCATCT ATCAATACAT CXWTGATTTB TATGTAGGAT 2120 

CIQACTTAGA AATAGGGCAG CAXAGAAC&A AAA3KSU3GA ACTOAGaCaA GVTCTfiTTGA 3190 

OOTOOGOATT TMCA&CCA OACAAAAAAC ATCAGAAAGA AOCTCC A TTC CtrtGOATOa 3240 

GTTATQAACT CCATCCTOAT MXXOCUQa XACAGOCIAT AOTGCTGCCA GAAAAGGACA 3200 

GCIGGACTBT CSOTQACXXA CAGAAATTAG TGGGAAAATT GAATTGGGCA AGTCAGATTT 3340 

ATGCACGGAT TAAAGTAAGG CAATTAMTA AACTTCTTAO OGGAftCCAAA OCACTAACW 3420 

AAQTAGXACC ACXAACAGAA GAAGQflAOC TMAACIGGC AOAAAACAGO GAGATTCTAA 3410 

AAGAACCGCT ACA1GGAGTQ IKTIXXGACC OtTCAAAAGA CTTAAIAGCA GAAASACAGA 3540 

AGCAGGOGCA AGOCCAATGG ACATATCAAA TTTATCAAGA OCOtlTTTAAA AAICTOAAAA 3600 

CAGGAAAAZA TGCAAGAATG AAGGGTGCCC ACACTAATGA TGTGAAACAA TtAACAGAGG 3660 

CAGTACAAAA AKMCCACA GAAAGCATA6 TAAXATQGGG AAAQACTOCT AAATTTAAAT 3720 

TACCCATACA AAAGGAAACA TGGGAA3CAT OOTCWACAta GTATTOGCAA G CLA CCTQ GA 3780 

TTCCTGAGTG GGAGTTTGTC AATACCCCTC CCTTAGTGAA GTtAKOtAC CA3TTAGAGA 3940 

AAGAAOCCAr AATAGGAGCA GAAACTTTCT ATOTAGATGG GGCAGCCAAT AGGGAAACIA 3900 

AATTAGGAAA AGOGGATAT GTAACTOACA GAGGAAGACA AAAAGTTGTC CCOCTAACQQ 3960 

ATACAACAAA TCMAAGACr GAGTTACAAG CAATKATCT AGCTTTGCAG GATTO0GGAT 4020 

lAGAAGTAAA CATAGTQACA GACTCACAAT ATaCAXtOGG AAICTTTCAA GCACAACCAG 4080 

ATAAGAG7QA ATCAGAGTXA OTCAOTCAAA tAAIAGAGCA GTEAATAAAA AA GGAAA A AG 4140 

TCTAOC3GOC AXGGGTAOCA GCACACAAAG GAATTOGAGG AAATGAACAA GXAGATGGGT 4200 

TGGTCAGTCC TGGAATCAGO AMGtACT*! TTTfAGATQG AAXAGAtAAG QCCCAAGAAO 4260 

AACATGAOAA AIATCACAOT AATTOGAGAG CAATGGCTAG TGATTTTAAC CTAa'ACCIQ 4320 

TAGTAGGAAA AGAAATAGTA 000000X0 A3AAAOTTCA GCTAAAAGGQ GAAfiCCKXGC 4310 

AZOGACAAGT AGACTOTAGC CCAGGAAXAT GGCAGCTAQA TTGTACACAT TTAGAAGGAA 4440 

AAGTTATCTT OGTAOCAOTT OKOTAOCCA GXGGATAXAT AGAAGCAGAA OTAATTCCAO 4500 

C5W3AGACAGG GCAAQAAACA OCATM.TIU; TCTTAAAATT AGCAGGAAGA 7QQ0CAGXAA 4560 

AAACAGTACA TACAGACAAT QOCAGCAATT TCACCaGXAC TACAGTTAAG GCOSCCXOTr 4620 

GBTGGGCBGO GATCAAGCAG GAATTTGOCA TTCCCTACAA TCCOCAAAGT CAAGGAGTAA 4680 

TAOAATCTAT GAAXAAAGAA TTAAAGAAAA TTATAGGACA GGTAAGAGA9 CAGOCIOAAC 4740 

ATCTTAAGAC AGGAGTACAA ATOGCAGXAT TCATOCACAA TTTTAAAAGA AA ASGO GOG A 4800 

TTCJGGGGGTA CAGTGCAGGG GAAAGAATAG TAGACAtAAT AGCAACAGAC ATACAAACIA 4860 

AAGAATTACA AAAACAAATT ACAAAAATXC AAAATTTTOa GGTTTXTTAC AGGGACAGCA 4920 

GAGATOCAaT TIOGAAAGGA CCAGCAAAGC TCCTCTGGAA AGQTGAAGGG GCAGTAGTAA 4960 

TACAAGAtAA TAGTGACATA AAAGTAGTGC CAAGAAGAAA AGCAAAGATC ATCAGGGATT 5040 

A30GAAAACA GjKXGOOQQFI GUGATIQIO 'TGOCaAGTAG ACAGOAXOAG GATTAACACA S100 
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TGOAAAAGAX TAGTAAAACA CCAMGEAT ATTTOttGGA AAOCZAAOaA CTOTTT1TAT 5160 

AflACATCACT A2GAAA0TAC TAAXCttAAA AJAACTTCAO AAfftftCACRT CCCACtAQW 5220 

GAT6CTAAAT TAGTAATAAC AACATATTGG OdTCICCATA CACGAOAAW3 AGACTQCCAT 5200 

TTQOOTCAGa OAGTCTOC&T AGAATGGAGO AAAAAGAGAT ATAOCACACA ACTAOACCCT 5340 

GACCTAGCAG AOCAACTAAT TCAICIOCAC TKCTTTtSMT GTTTTTCAGA ATCrGCTAIA 5400 

AfiAAATACCA TATTAGGACB TATASTOGT CCTAOTCTS AATAJCAAaC ASQAttXAAC 5460 

AAGGXAGGAT CTCIACAGTA CTTSGC2*CTA GCAGCATTAA TAAAACCAAA ACAOAXAAAQ 5520 

CCACCTTTOC CXAOXQTTfta GAAACXGACA GA3GACAGAT GGAACAAGCC CCAOAACACC 5560 

AAaOGCCACA OAQOQACCCA 1ACAATGAAT OQACACXM0A GOTTIAGAG GAACTXAABA 5640 

GTGAAGCTCT TAGACATTTT CCTAGGATAX GGCTCCATAA CTTAGOACAA CATATCIATG 5700 

AABCTXACQQ GGATACTTOO OCaUUAUTUU AAOCCATAAT AAGAATTCTG CAAGAACTGC S760 

TCTTtATCCA TTXCAQAftTT OGGTOTCGAC ATOSatGAAr AGGOGTXACT CBACAgAOGA 5820 

GAGCAAOAAA TOGAOCOW3T AQtfCCTAGA CEAGACCCCT OOAAaCATCC AGQAAGTCAO 5860 

CCTAAAACIG CllUiAOGAA TTGCTA1T0T AAAAAGTGTT GCTTTCATTG CCAAOtTTOT 5940 

TTCATBACAA AAGCCTIAGB CAT C rCCIAT GGCAQGAAGA AGGQOA&ACA GCQACSAAGA 6000 

GCTULTCAQA ACAGTCAGAC TCATCAAGCT TCTCIATCAA AGGASIAASX AGZAGATCHA 6060 

xsaa&cax taaxmtzagc aatacixh3CA teasxagzag caataazaat aocaatagtt 6 120 

O lQl gg iC CA TAGTAATCAT AaAAXKXAOO AAAAXATXAA GAOtAAOAAA AA2AGAGAGG 6180 

TD^MTOATA GftCXAATAGA AAGAGCTSAA GftCMKGGJCA A7SAGRGTGA AOGAQAAQTA 6240 

TCAGCACTTO TGOAGKXOGG GGTOGAAATG GCJGCACCATG ClllTlU^UA TATTaATOAt 6300 

CTtrTAGTTCT ACAOAAAAAt TQT GGG TC A C AOTCXATTAT GGGGTAOCTG TGIGGAAGGA 6360 

AGGAACGACC A CIf T A TTlT GTOCATaKIA TQCXAAAQCA TATGATACAG AOCITACAtAA 6430 

l lf l TlUUUC L' ACACATGCCT GIGZACCCAC AOACCCCAAC OCACAAGAAG TAOIATTGGT 6480 

AAATCTTOACA GAAAATITTA ACATGTGOAA AAATQftCATG GTAOAACAGA TGCATOAGGA 6S40 

TATAATCAOT TtATGGGATC AAAGCCTAAA GCCXXGTGXA AAATTAAGCC CACTCXOTGT 6600 

rAGTTTAAAO TCCACTGATT TCAAGAATOA TACEAATACC AATAGTAGTA GEBQBNSffit 6660 

GAXAATGGAG AAAOQAQAOA lAAAAAACXa CTCTTTCAAT AXCM3CACAA GCATAAGAGA 6720 

TAAGGTGCAG AAAOAATATQ CATTCTTTTA TAAACXIQAT AIAGTACCAA TAGATAAXftC 6780 

CAQCTAXAQG TTGAXAAGJTT GTAACACCTC AGXGATTAai CA93CC3PGXC CAAAGGTATC 6840 

CTTTGA9CCA ATTOOCATAC ATTATJXjIUC COCOGCTGQT TXT309AT7C 3AAAAIBTAA 6800 

TAAXAAOAOa TTCAATQSAA CAOOACCATa tACUATGTC AOaUMJTAC AA3BZACACA 6960 

TSGAATCAGG COflTAOTAT CAACTCAACX OCTGTTAAAT OCCACTCTAO CAGAAfiAAGA 7020 

ItJTWnAATT AQ A TCT GOCA ACTTCACASA CAATGOAAA ACCATAATAO TACAOTGAA 7080 

CACATCiaiA QAAATTAATT GXACAASAOC CAAOACAAT ACAAOAAAAA QtA TCCOlA T 7X40 

OCAQAGGOOA CCAOTOAGAG CKTTTGXXAC AATAQQAAAA A3AQQAAAIA TQAQACAAGC 7200 

AOOTCTAAC ATTAOXAOAG CAAAATOSAA TGCCACTTTA AAACAQAIAOJ CTASCAAAXT 7260 

AASAGAAGAA TTXaOAAftZA ATAAAACAAT AKtCITTAAG OWICCKM OROOOQACQC 7220 

XSAAATTGTA ACBOlCAaTT nAAllViCiUU AOOGGAATTT TTCIACIQXA ATTCAACACA 73 §0 

AL'ltrl'lTAAT AGXACTtOOT RAA3AGT&C TTGOAGZACT GAA3QGTCAA AIAACACTGA 7440 

AGGAAOXGAC ACAAICACAC T0OCATGCAO AATAAAACAA TTXATAAACA TOT03CAOGA 7500 

AGtAGGAAAA OCAAXtRATS OCCCTO CC AT CAOTGOACAA ATTAGATCTT CATCaAATAT 7560 

TACrooocro cearaacaa aAOArooroo taaxmoac a a t ggot cc s agatcttcao 7620 

ACCTOOAGGA aOOQATATGA SSGACAATTa OASAAOIOAA TTAZAZAAAT AXAAAGTAOT 7680 

AAAAAXTOAA CCATTADGAQ tAOCACCCAC CAAOKAAAO AaAAOACTGC tmUBAQAOA 7740 

AAAAAGAOCA GIGGQAAXAO GAOCTTTCTT iAA ' LUUUm TT fW B ABTA ff CAOTA3CAC 7800 

1ATOOSCZGC ACGTCAATBA CQCIQAOGOT ACAGGGOOA CAARAXTQT CSQAXAlAflT 7660 

GCTGQQCAfi AAOMRtGC tGAOSQCXAT 1QAOGC8CAA CAGCATC3GT TGCAACTCAC 7»0 
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AGTCTGGGGC ATCWUCAGC TCCAGOCAAO AATCCTOGCT OTGOAAAOAT ACCEAAAOGA 7960 

TCXACAGCTC CtGOOOKTrr OUUUTiaTC TGGAAAACTC ATRGCAQCA ITCCICT CC C 8040 

RG0&A1GCT AQTIQGAOTA ATAA AI C T CT GGAACAGATT TGGAATAACA TGACCTGGAT 6100 

CXSAtSIQQSAC AOAGAAATTA ACAATTACAC AAGCTXAAXA COCtCCTM TTBMGAATC 8160 

aCAAAACCAO CAAOAAAAOA AIOAACAAOA Ar EA TTO ftA A TXROAIAAAX GOGCAAGTTT 8320 

GTGOAATXOO TTIAACAXAA CAAAXTGOCT GTQQTAXAtA AAAXTRTTGA TAATGATAGT 8280 

AOOftsacno otaogtttaa gaataotttt toctotacit tctaiagtga aiagagttag 8340 

QCAGOGATAT TCACGATTAT OOTTTCAGAC CCACCICOCA ATCCOOAOOG OACCCGACAG 6400 

GCCOGAAflGA AXAOAAGAAO AAGSTGGAOA OAOAOACACa OACAOATOCA TTCSATtAOT 8460 

QMcaanxoc togga/ctta tctggoacoa tctocooagc ctgigcctct tcagctacca ssao 

ccocttoaoa gacttactct toaitctaac gaogattgxg gaacwcxoo OAcacAOooa ssso 

oroooAAacc ctcxaatatt ooxogaaxct cctacactat TaaaarcAaa aactaaagaa smo 

TAQTGCTOTT AACRGCXOh AJSC O C A fl C CAtftQCAOTA OCTGAGGGGA CAGA2A0Q0T G700 

TATAOAACTJl TTACMflCAO CTTATAGAGC TATTOGCCAC ATACCTAGAA OAATWU3ACA 6760 

O00CTT0GAA AGGATTTTOC TATAAGATQQ GTGGOttGTG OTCAAAAAGT AEJIGTQATTQ 8830 

GMGGGCTOC TOTAAGGGAA AGAATQAOAC OAOCTOAOCC AQGMKH3AT 00001000X0 8860 

CACTATCTOO AOACCXAGAA AAACATOOM CAAXCACAAQ TAOCAAXACA CCAGCiaACA 8940 

ATOcroaro taccroaca GAAoatcAAQ Mcaomwa uurauomr ccagotcac 9000 

CTOkOaiXCC TTXAAOACCA AIUAL1TACA AGGCAGCXGT AOATCTTAOC CACTITIIAA 9060 

AAOAAAAOOQ qOOACTOQAA GGGCZUttC AttCCCAAAO AAOACAAOAT ATCCTTGATC 9X30 

TCTOSATCIA CCACACACAA OOCZACTXCC CTQATXaOCA OAACXACACA CCAGGGCCAO 9180 

QQGTCAGAZ&, TCCACTOACC 1 1 1UUATQC3T OCTACRAGCr AOTACCXOTT OAOCCfiflATA 9240 

AOGXAQAAOA OOCCAAlfcAA GGAGAGAACA CCAGCTTCTT ACACCCTOW AOCCIGCATG 9300 

aAATOOATOA CCCTGAGAGA OAAOTOTtAO AOiaOAOOTT TOACAGCOGC CTAGCATTTC 9340 

ATCAOQTQQC CCGAGAGCTO GKTOCQOMT AdTCMOAA CXQCXOACAX OGAGCTTCCT 9420 

ACAAOaa AC T TT C O e cr s aa OlOTTCCAO GOAGOCGTGO OCTOOOOOSG ACTQSGGAflT 9460 

OOOOAGCCCr CABAWCTOC ATATAAOCAO HUH ' illlU C r C TCTC IGG 9540 

TXAOACGMIA, TCXGAOCCTO OOMCXCXCT OaCXAACXM OOAACCCACT OCTTAACOCT 9600 

CAATAAACCT T CO CT KU UW OCTTCAAOOfc OTOTOTOCCC OTCXXRTSXa I UA CTCIOST 966D 

AACTAGAflAT OCCrCAOACC CTTTTAOTCA GTOTGOAAAA TCTCTACCA 9709 
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(2) INFORMATION FOR SEQ ID N02: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* nacktetcid 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: 



(ID MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ JD NO* 
ATCGGTGGCA 

(2) INFORMATION FOR SEQ ID NOS: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nnckfc add 

(Q STRANDEDNESS: (ingle 
(D) TOPOLOGY: 



<H) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ IDN03: 
TGGGIGGGAA 

(2) INFORMATION FOR SEQ ID NO* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 
<xl) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
GGGTGGCAAG 

(2) INFORMATION FOR SEQ ID NOd: 

© SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pain 

(B) TYPE: nudetetcM 

(C) STRANDEDNESS: ffasgte 

(D) TOPCHjOGY: Ibeer 

(H) MOLECULE TYPE: DNA 
OcD SEQUENCE DESCRIPTION: SEQ ID N05: 
GGTGGCAAGT 

CZ) INFORMATION FOR SEQ ID NO£: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bnscptin 

(B) TYPE: nnddc acid 

(C) STRANDEDNESS: sin^e 



(D) TOPOLOGY: linear 
(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:6: 
QTCGCAAGTO 

<2) INFORMATION FOR SEQ ID NO:7: 

CD SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
(B) TYPE: nucleic acid 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: Ibear 

<ii) MOLECULE TYPE: DNA 

<n) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

TGGCAAGIGG 

(2) INFORMATION FOR SEQ ID Na& 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE ondcie add 

(C) STRANDEDNESS: ibgk 

(D) TOPOLOGY: linear 

CD MOLECULE TYPE* DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO£: 
GGCAAGTGGT 

(2) INFORMATION FOR SEQ ID NO& 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mrfclc arid 

(C) STRANDEDNESS: angle 

(D) TOPOLOGY: 



(S) MOLECULE TYPE: UNA 
(*D SEQUENCE DESCRIPTION: SEQ ID NO& 
GCAAGTCGTC 

(2) INFORMATION FOR SEQ ID NChKh 

© SBQUQKX CHARACTERISTICS: 
CA) LENGTH: 10b«»ptte 

(B) TYPE: nscfefe add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(zi) SEQUENCE DESCRIPTION: SEQ IDKO-.10: 
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CAAOTGOTCA 

(2) INFORMATION FOR SEQ ID NO: 11: 

0 SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudele odd 

(C) STRANDEDNESS: ilngte 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AAGTGGTCAA 

(2) INFORMATION FOR SEQ ID N0:12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObajcpoin 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

AOTOGTCAAA 

(2) INFORMATION FOR SEQ ID NO:I3: 

<Q SEQUENCE CHARACTERISTICS: 

(A) LENGT1H: 10 base paia 

(B) TOTE: nucleic add 

(Q STRANDEDNESS: single 
03) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:13: 

OTGGTCAAAA 

W INFORMATION FOR SEQ ID NO:14: 

© SEQUENCE CHARACTERISTICS: 

(A) LENC7IR 10 b«c paia 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ID MOLECULE TYPE SNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:M: 

TQOTCAAAAA 

(2) INFORMATION FOR SEQ ID NO:I3: 

0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base (udn 
(P)TYPE:inicWc«dd 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 1 5: 
GGTCAAAAAG 

(2) INFORMATION FOR SEQ ID NO:16: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic tcid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 'ID NO: 16: 

GTCAAAAAOT 

(2) INFORMATION FOR SEQ ID NO:17: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 7: 

TCAAAAAGTA 

(2) INFORMATION FOR SEQ ID NO: 18: 

00 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 esse pain 
(0) TYPE: nucleic acid 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQlDNOrlS: 

CAAAAAGTAG 

(2) INFORMATION FOR SEQ ID NO-.19: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENC7IH: 10 base pairs 

(B) TYPE: nuckic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DMA 

CsD SEQUENCE DESCRIPTION: SEQ IDNO:!* 

AAAAAGTA0T 
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(2) INFORMATION FOR SEQ ID NO20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bascpdn 

(B) TYPE: audeic acM 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: Eacar 

(H) MOLECULE TYPE* DNA 

W SEQUENCE DESCRIPTION: SBQ ID NO20: 

AAAAGTAGTO 

(2) INFORMATION FOR SBQ JD NQ21: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- lObasepsbs 

(B) TYPE: nucleic tcid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 1 



© MOLECULE TYPE: DNA 
Cd) SEQUENCE DESCRIPTION: SEQ ID N021: 
AAAGTAGTGT 

(2) INFORMATION FOR SEQ ID N022: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pais 

(B) TYPE: nuddc add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N022: 
AAGTAGTGTG 

CZ) INFORMATION FOR SEQIDN03J: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 be* pairs 

(B) TYPE; nocfcfe add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: 



(H) MOLECULE TYPE: DNA 
(d) SEQUENCE DESCRIPTION: SEQIDN023: 
AGTAGTGTGA 

(Z) INFORMATION FOR SEQ ID N024: 

fl) SEQUENCE CHARACTERISTICS: 
(A) LENGTH 10 base pairs 
TO TYPE: nodelc add 
<Q STRANDEDNESS: dogfe 
(D) TOPOLOGY: 



(D) MOLECULE TYPE: DNA 
04 SEQUENCE DESCRIPTION: SEQ IDN024: 
GTAGTCIX5AT 

(2) INFORMATION FOR SEQ ID N025 : 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: tmcldc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(S) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ KDN025: 
TAGTOTGATT 

CZ) INFORMATION FOR SEQ ID N02& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: midelc add 

(Q STRANDEDNESS: ilogte 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIFnON: SEQ ID NO-26: 

AGTCTGATTG 

0) INFORMATION FOR SEQ ID N027: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH lObasepaUi 

(B) TYPE* nudeicactd 

(C) STRANDEDNESS: lingle 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: UNA 
(xQ SEQUENCE DESCRIPTTON: SEQIDN027: 
UlUiliATTtSG 

(2) INFORMATION FOR SEQ ID NOdfc 

(D SEQUENCE CHARACIkKJSTlCS: 
(A) LENGTH lObanpaxn 
TO TYPE: nuddc add 
(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SBQ ID N028: 

TGTCATIGGA 

(2) INFORMATION FOR SEQ ID N029: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(9) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N029: 

GTOATTGGAT 

(2) INFORMATION FOR SEQ ID NO30: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 baas pairs 

(B) TYPE: oneiric add 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: ttncir 

(D) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO JO: 
TGATTGGATC 

(2) INFORMATION FOR SEQ ID NO"Jl: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 hue pain 

(B) TflPE; nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



MOLECULE TYPE: DNA 
(id) SEQUENCE DESCRIPTION: SEQ ID NO*JI: 
GATTGGATGG 

(2) INFORMATION FOR SEQ IDN032: 

(0 SEQUENCE CHARACIERtSTTCS: 
(A) LENGTH: 10 base pm 
0B) TYPE: nucleic add 
(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: 



(U) MOLECULE TYPE: DNA 
Cal) SEQUENCE DESCRIPTION: SEQ ID N032: 
ATTQGATGGC 

(2) INFORMATION FOR SEQ ID N033: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N033: 
TTGGATGGCC 

(2) INFORMATION FOR SEQ ID NOJ4: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pant 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
«) SEQUENCE DESCRIPTION: SEQ ID N034: 
TGGATOGCCT 

(2) INFORMATION FOR SEQ ID NO:35: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudtxc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDND35: 
GGATGGCCTG 

(2) INFORMATION FOR SEQ ID N036: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: imgfc 

(D) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N05* 
GATGGCCTGC 

<2) INFORMATION FOR SEQ ID N037: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(O STRANDEDNESS: stogie 
(D) TOPOWGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N037: 

ATGGCCTOCT 

(2) INFORMATION FOR SEQ ID N03« : 
(I) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: lOb&sepsa 

(B) TYPE: uudeteicid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

fci) SEQUENCE DESCRIPTION: SEQ IDN038: 

TGGCCTGCTO 

(2) INFORMATION FOR5EQ ID NO-J9: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 10 base pairs 

(B) TYPE: mtcfeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bneer 

(H) MOLECULE TYPE; DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:39: 
CGCCTOCTGT 

<2) INFORMATION FOR SEQ ID NO:40: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE; nudde acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NOsUh 
OCCTGCTGTA 

(2) INFORMATION FOR SEQ ID NOrtl: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nocfefc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(3d) SEQUENCE DESCRIPTION: SEQ ID NO*!: 
CCTCCTOIAA 

(2) INFORMATION FOR SEQ IDNOtfl- 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 btse pahs 

(B) TYPE: mtddc add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(kQ SEQUENCE DESCRIPTION: SEQ ID NOtf2: 
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CTGCIGTAAO 

(2) INFORMATION FOR SEQ ID NO*3: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudde add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO*3: 
TGCTCTAAGG 

(2) INFORMATION FOR SEQ IDNO*4: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nndele acid 

(C) STRANDEDNESS: single 

(D) TOPCuUXTY: fines* 

(H) MOLECULE TYPE: DNA 
tf) SEQUENCE DESCRIPTION: SEQ ID NO:44: 
GCTGTAAGGG 

(2) INFORMATION FOR SEQ ID NO:45: 

(0 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
(B)TYPE: onddc add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<1I) MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ ID NCM5: 

CTGTAAGGGA 

(2) INFORMATION FOR. SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: msdeic acid 

(C) STRANDEDNESS: single 

(D) TX»<)LOGY: linear 

(8) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NOtf& 
TGTAAGGGAA 

<2) INFORMATION FOR SEQ ID NO*7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nudde add 
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(Q STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GTAAGGGAAA 

(2) INFORMATION FOR SEQ ID NO*fc 

O SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Unev 

(il) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
TAAGGGAAAG 

CO INFORMATION FOR SEQ IDNO** 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bra pun 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO** 
AAGGGAAAGA 

0) INFORMATION FOR SEQ ID NO:50: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: lObasepafas 
<B) TYPE: nudeic acid 
<Q STRANDEDNESS: stogie 
(D) TOPOLOGY: Hcev 

fri) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO:50: 

AQQGAAAGAA 

(2) INFORMATION FOR SEQ ID NfeSl: 

0) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 bra pain 

(B) TYPE: nudeie aczd 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: Bnear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N031: 

GGGAAAGAAT 



(2) INFORMATION FOR SEQ ID N0J2: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ IDNO:52: 

GGAAAGAATG 

<2) INFORMATION FOR SEQ ID NO:53: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

{Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ 1DN053: 

GAAAGAATGA 

(2) INFORMATION FOR SEQ ID N034: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(jd) SEQUENCE DESCRIPTION: SEQ ID NG.54; 
AAAGAATGAG 

(2) INFORMATION FOR SEQ ID N033: 

© SEQUENCE CHARACIERBTICS: 
(A) LENGTH: 10 base pain 
' O) TYPE: nndefc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cd) MOLECULE TYPE: DNA v 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:55: 
AAGAATGAGA 

(2) INFORMATION FOR SEQ ID N036: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodek add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Bnear 
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(E) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SEQ ID NOS6: 
AGAATGAGAC 

(2) INFORMATION FOR SEQ ID NO:57: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nnctcic acid 

(Q STRANDEDNESS: single 
93) TOPOLOGY: linear 

(0) MOLECULE TYPE; DNA 

(xQ SEQUENCE DESCRIPTION: SEQIDNO-J7: 

GAATOAGACG 

(2) INFORMATION FOR SEQ ID NO£8: 

© SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 10 base pates 

(B) TYPE: micldcadd 

(Q STRANDEDNESS: (logic 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N058: 

AATQAGAOQA 

(2) INFORMATION FOR SEQ ID NOSfc 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Eoeaf 

00 MOLECULE TYPE: DNA 
Cri) SEQUENCE DESCRIPTION: SEQ ID N039: 
ATCAGACGAG 

(2) INFORMATION FOR SEQ ID NOS0: 

(I) SEQUENCE CHARACTERISTTCS: 

(A) LENGTH: 10 baa pain 

(B) TYPE: nuddc acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNChfiO: 
TGAGACGAGC 

(2) INFORMATION FOR SEQ ID NO£l: 



(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* rmeteteadd 

(Q STRANDEDNESS: stogie 
(D) TOPOLOGY: fiaear 

(H) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:6l: 

GAGACGAGCT 

(2) INFORMATION FOR SEQ ID NO*2: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pain 

(B) TYPE: nwtele acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear v 

(H) MOLECULE TYPE: DNA 
(zi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 
AGACGAGCTG 

(2) INFORMATION FOR SEQ ID NO*3 : 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 
(id) SEQUENCE DESCRIPTION: SEQ lDNO£3: 
GACQA0CTQA 

(2) INFORMATION FOR SEQ ID NO*4: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGIH: 10 base pairs 
(P) TYPE: nucleic add 
(Q STRANDEDNESS: aingle 
(D) TOPOLOGY; linear 

00 MOLECULE TYPE: DNA 

(xJ> SEQUENCE DESCRIPTION: SEQIDNOtf4: 

ACGAGCTCAG 

<2) INFORMATION FOR SEQ ID NOrfS : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudefc add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: ttaear 

(H) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:65: 
CGAGCTGAGC 

(2) INFORMATION FOR SEQ ID NO:66: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pita? 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE; DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*6: 
GAGCTGAGCC 

(2) INFORMATION FOR SEQ ID NO£7: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodefe add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNOrf7: 
AGCTGAGCCA 

(2) INFORMATION FOR SEQ IDNO** 

Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(Ii) MOLECULE TYPE. DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NOrffc 

GCIGAGGCAG 

(2) INFORMATION FOR SEQ ID NO:69: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodefe add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(D) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:69: 
CTQAGCCAGC 

(2) INFORMATION FOR SEQ ID NfeTO: 

00 SEQUENCE CHARACTERISTICS: 



(A) LENGTH* 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:70: 
TGAGCCAGCA 

<2) INFORMATION FOR SEQ ID NO:71: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuddc add 

(C) STRANDEDNrSS: single 

(D) TOPOLOGY: tear 

(Ii) MOLECULE TYPE DNA 
<xQ SEQUENCE DESCRIPTION: SEQ ID NOtf 1: 
GAOCCAGCAG 

(2) INFORMATION FOR SEQ IDN032: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nucleic add 

CQ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

<*J) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

AGCCATCAGC 

(2) INFORMATION FOR SEQ ID NO:73: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nodctc add 

CO STRANDH3NESS: single 
(D) TOPOLOGY: finear 

(H) MOLECULE TYPE: DNA 

(if) SEQUENCE DESCRIPTION: SEQ ID NO:73: 

GCCAGCAGCA 

CZ) INFORMATION FOR SEQ ID N034: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pahs 
O) TYPE: nucleic add 
(Q STRANDEDNESS: tiftgEe 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
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CCAGCAGCAG 

(2) INFORMATION FOR SEQ ID NO:75: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* mtdde add 

(Q STRANDEDNESS: singe 
CD) TOPOLOGY: Linear 

(il) MOLECULE TYPE: DNA 

(id) SEQUENCE DESCRIPTION: SEQ ID NOtfS: 

CAOCAGCAQA 

Q) INFORMATION FOR SEQ ID NO:76: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: lingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO:76: 

AGCAGCAGAT 

(Z) INFORMATION FOR SEQ ID NO:77: 

CO SEQUENCE CHARACTERISTICS; 

(A) LENGTH* 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<® MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ IDNOr77: 

GCAGCAGATG 

(2) INFORMATION FOR SEQ ID NOSfc 

(Q SEQUENCE CHARAClhKlSilCS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: msddc acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(x$ SEQUENCE DESCRIPTION: SEQ IDN038: 
CAGCAGATGG 

INFORMATION FOR SEQ ID >Kfc79: 

(0 SEQUBTCE CHARACTERISTICS: ' 
(A) LENGTH: 10 base pain 
03} TYPE: nnctetc acid 



(C) STRANDEDNESS: single 
CD) TOPOLOGY: 



<p) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 
AGCAGATGGG 

(2) INFORMATION FOR SEQ IDNO:S0: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: fiaetr 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ IDNOS0: 

GCAGATGGGG 

(2) INFORMATION FOR SEQ ID NO* 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE; DNA 
(xf) SEQUENCE DESCRIPTION: SEQ IDNOdtl: 
CAGATGGGGT 

(2) INFORMATION FOR SEQ ID NO-.S2: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Boetr 

(H> MOLECULE TYPE DNA 
Op) SEQUENCE DESCRIPTION: SEQ ID NO-J2: 
AGATGGGGTG 

(2> INFORMATION FOR SEQ ID NOJ3 : 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pahs 
' (B) TYPE: mdeic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: Dnear 

00 MOLECULE TYPE: DNA 

£□) SEQUENCE DESCRIPTION: SEQ ID NO£3: 

GATGGGCTCG 
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(2) INFORMATION FOR SEQ ID NO:84: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic tcid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: I 



ffi) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:S4: 
ATGOQOTGGG 

(Z) INFORMATION FOR SEQO>NO:S5: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

0) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:85: 

TGGGGTOGGA 

(?) INFORMATION FOR SEQ ID NO*6: 

G) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 tee pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fincir 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:86: 
GGGOTGGGAG 

(2) INFORMATION FOR SEQ ID NOS7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singte 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: UNA 
(zi) SEQUENCE DESCRIPTION: SEQIDNO:87: 
GGGTOGGAGC 

(2) INFORMATION FOR SEQ ID NOA8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE nucleic add 

(Q STRANDEDNESS: linglo 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8S; 
GGTGGGAGCA 

^ INFORMATION FOR SEQ ID NO; 89: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base sain 

(B) TYPE* nucleic add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE* DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO** 

GTGGGAGCAG 

(?) INFORMATION FOR SEQ ID NO50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* nucleic add 

(O STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 

<Xi) SEQUENCE DESOUFTION: SEQ ID NO30: 
TGGGAGCAGT 

(2) INFORMATION FOR SEQ ID NO:91: 

(0 SEQUBTCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE middc acid 

(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: fine* 

(II) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N031: 

GGGAGCAGTA 

(2) INFORMATION FOR SEQ ID NOS2: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nockfc add 

(P) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO£2: 
GGAGCAGTAT 

(2) INFORMATION FOR SEQ ID NO£3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mtdek add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 
0d) SEQUENCE DESCRIPTION: SEQIDN093: 
GAGCAGTATC 

(2) INFORMATION FOR SEQ ID NOS4: 

® SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY; linear 

(II) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN034: 

AGCAGTATCT 

(2) INFORMATION FOR SEQ ID NOS3: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pairs 

(B) TYPE: nuddc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:95: 
OCAGTATCTC 

(2) INFORMATION FOR SEQ ID N03& 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENOIR lObesepaus 

(B) TYPE: nudeicadd 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

W> SEQUENCE DESCRIPTION: SEQ IDN036: 

CAGTATCTCG 

(Z) INFORMATION FOR SEQ ID NO£7: 

© SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
09} TYPE: nndefeadd 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: Doear 

(5) MOLECULE TYPE: DNA 



Qd) SEQUENCE DESCRIPTION: SEQ IDNOS7: 
AGTATCTCGA 

(2) INFORMATION FOR SEQ ID NO08: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

0i) MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ IDN058: 
GTATCTCGAG 

(2) INFORMATION FOR SEQ ID NOS9: 

(l) SEQUENCE CHARACTERISTICS: . 
(A) LENGTH: 10 base pain 
(D) TYPE: mdeifi add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA ' 

fjrf) SEQUENCE DESCRIPTION: SEQ IDN059: 

TATCTCGAGA 

(2) INFORMATION FOR SEQ ID NO. 100: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeicadd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: rmear 

Cd) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO100: 

ATCTCGAGAC 

(2) INFORMATION FOR SEQ ID NO101: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(Q) MOLECULE TYPE: DNA 
(*Q SEQUENCE DESCRIPTION: SEQ ID NOrlOl: 
ICTCGAOACC 

(2) INFORMATION FOR SEQ ID NO:102: 
CD SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pain 

(B) TYPE nudeic odd 

(C) STRANDEDNESS: single 
p) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO102: 

CTOGAGACCT 

(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO: 103: 
TCGAGACCTA 

(3) INFORMATION FOR SEQ ID NO:104: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
CB)TYPE nucleic acid 
CQ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(□> MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:104: 

COAGACCTAG 

(2> INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) UENGTH: 10 base pain 

(B) TYP£ midelcadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE' DNA 
(kQ SEQUENCE DESCRIPTION: SEQ ID NCfclOJ: 
GAGACCTAGA 

(2) INFORMATION FOR SEQ ID NO-.10& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nucleic add 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

fu) MOLECULE TYPE DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO106: 



91 - 

AGACCTAGAA 

(2) INFORMATION FOR SEQ ID NO: 107: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE nbclelc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(B) MOLECULE TYPE DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:107: 
GACCTAGAAA 

(2) INFORMATION FOR SEQ ID NO:10fc 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudeic add 

(Q STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

09MOI8CULE TYPE DNA 

Cd> SEQUENCE DESCRIPTION: SEQ ID NO:10& 

ACCTAGAAAA 

(2) INFORMATION FOR SEQ ID NO: 109: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nudeic acid 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:10* 

CCTAGAAAAA 

(2) INFORMATION FOR SEQ ID NO:l 10: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObase 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 
CTAGAAAAAC 

(2) INFORMATION FOR SEQ D> mill: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic acid 
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(O STRANDEDNESS: single 
<D) TOPOLOGY; Untax 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNOrlll: 

TAGAAAAACA 

(2) INFORMATION FOR SEQ ID NO: 1 12; 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi> SEQUENCE DESCRIPTION: SEQ ID NO:112: 
AGAAAAACAT 

(2) INFORMATION FOR SBQ ID NO: 113: 

O SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(n) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 

GAAAAACATO 

C2) INFORMATION FOR SEQ ID NO:l 14: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuclele add 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(Jd) SEQUENCE DESCRIPTION: SEQIDNO:114: 

AAAAACATGG 

(2) INFORMATION FOR SEQ ID NO:! IS: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pain 

(B) TYPE: flsdefe add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Iloear 

fp"> MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTOR: SEQ ID NO: 115: 

AAAACATGGA 



(2) INFORMATION FOR SEQ ID NO:il6: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: fintar 

(fl) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 116: 

AAACATOGAG 

(2) INFORMATION FOR SEQ ID NO:l 17: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucWc acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: 



(B) MOLECULE TYPE: DNA 
fxD SEQUENCE DESCRIPTION: SEQ ID NO:117: 
AACATGGAGC 

(2) INFORMATION FOR SEQ ID NO:lIS: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 
STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
£ri) SEQUENCE DESCRIPTION: SEQ ID NOU18: 
ACATGGAGCA 

(2) INFORMATION FOR SEQ ID NO: 11 9: 

0) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
.(B) TYPE: nucleic add 
(Q STRANDEDNESS: single 
£D) TOPOLOGY: finesr 

00 MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQIDNO:119: 

CATOGAGCAA 

f2) INFORMATION FOR SEQ ID NO: 120: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: rmdeio acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 1 
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(E) MOLECULE TYPE; DNA 
(u) SEQUENCE DESCRIPTION: SEQ ID NO:120: 
ATGGAGCAAT 

(2) INFORMATION FOR SEQ ID NO:121: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 hue pan 

(B) TYPE: oadolc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 
TOGAGCAAIC 

(?) INFORMATION FOR SEQ ID NO: 122: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObnepnrt 

(B) TYFE: nuekieackl 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

GQAOCAATCA 

(2) INFORMATION FOR SEQ ID NO: 123: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic kU 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Hour 

(u) MOLECULE TYPE: DNA 

(zi) SEQUENCE DESCRIPTION: SEQ ID NO:123: 

OAGCAATCAC 

fJZ) INFORMATION FOR SEQ ID NO:124: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPB: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xD SEQUENCE DESCRIPTION: SEQ ID NO:124: 
AGCAATCACA 

(2) INFORMATION FOR SEQ ID NO:125: 



{Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 

GCAATCACAA 

(2) INFORMATION FOR SEQ ID NO:126: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 
CAATCACAAO 

(2) INFORMATION FOR SEQ ID NO: 127: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: micfcio add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(Q) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:127: 

AATCACAAGT 

C2) INFORMATION FOR SEQ ID NO:123: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
0) TYPE: irackfc add 
' (Q STRANDEDNESS: single 
(0) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(XQ SEQUENCE DESCRIPTION: SEQ ID NO:I28: 

ATCACAAGTA 

(2) INFORMATION FOR SEQ ID NO: 129: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 bate pairs 
03) TYPE: micWc acid 
(O STRANDEDNESS: tingle 
0>) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
TCACAAGTAG 

(2) INFORMATION FOR SEQ ID NCU30: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 fane 

(B) TYPE xradeic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:I30: 
CACAAGTAGC 

(2) INFORMATION FOR SEQ ID NO:13 1: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pitas 

(B) TYPE: nucleic sod 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 



<U) MOLECULE TYPE: DNA 
<xl) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 
ACAAGTAGCA 

(2) INFORMATION FOR SEQ ID NO: 1 32: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(TO TOPOLOGY; linear 

0D MOLECULE TYPE: DNA 

<xl) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

CAAGTAGCAA 

(2) INFORMATION FOR SEQ ID NO: 113: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:I33: 
AAGTAGCAAT 

(2) INFORMATION FOR SEQ ID NO: 134: 

(i) SEQUENCE GIARACIERISTICS: 



(A) LENGTH: 10 bate pain 

(B) TYPE- micick acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
{xi} SEQUENCE DESOTPTCON: SEQ ID NO: 134: 
AGTAGCAATA 

C2) INFORMATION FOR SEQ ID NO:135: 

(0 SEQUENCE CHARACIBUSTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

© MOLECULE TYPE* DNA 
fti) SEQUENCE DESCRIPTION: SEQ ID NO:135: 
GTAGCAATAC 

^ INFORMATION FOR SEQ ID NO:13fc 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 
TAGCAATACA 

(2) INFORMATION FOR SEQ ID NQ:137; 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pairs 

(B) TYPE: nnddc add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

fri) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 
AGCAATACAG 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObtsepafas 
<B) TYPE: raddc add 
(C) STRANDEDNESS: tingle 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

0d) SEQUENCE DESCRIPTION: SBQ IDNO:138: 
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GCAATACAGC 

(2) INFORMATION FOR SEQ ID NO:139: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mickk add 

(Q STRANDEDNESS: single 
CP) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(ti) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

CAATACAGCA 

(2) INFORMATION FOR SEQ IDNOI40: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nwkio acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

AATACAGCAO 

(2) INFORMATION FOR SEQ ID NO:141: 

fl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: middc add 

(Q STRANDEDNESS: i ingle 
P) TOPOLOGY: linear 

gQ MOLECULE TYPE: DNA 

W) SEQUENCE DESCRIPTION: SEQ ID NO:141: 

ATACAGCAGC 

(2) INFORMATION FOR SEQ ID NO: 142: 

CO SEQUENCE CHARACTERISTICS: 

(A) LBK7IH: 10 base pafas 

(B) TYPE: onolele add 

<Q STRANDEDNESS: ilngje 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

d0 SEQUENCE INSCRIPTION: SEQ ID NO: 142: 

TACAOCAOCr 

(2) INFORMATION FOR SEQ ID NO. 143: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObafepabx 

(B) TYFE: nadfilo acid 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ft) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 

ACAOCAGCTA 

(2) INFORMATION FOR SEQ ID N0144: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: croddc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

0$ SEQUENCE DESCRIPTION: SEQ ID NO:144: 

CAGCA0CTAA 

(2) INFORMATION FOR SEQ ID ND:I45: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pafas 

(B) TYPE: nocteit add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 
(xJ> SEQUENCE DESCRIPTION: SEQ ID NO:)45: 
AGCAGCTAAC 

(2) INFORMATION FOR SEQ ID NO:14fc 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base palm 
(P) TYPE: onsfcfe add 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: finear 

(10 MOIECULE TYPE: DNA 
(u) SEQUENCE DESCRIPTION: SEQ ID N0146: 
GCAGCTAACA 

(2) INFORMATION FOR SEQ ID NO: 147: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bass pafas 

(B) TYPE: nucteie add 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: Smear . 

00 MOLECULE TYPE: DNA 

CcQ SEQUENCE DESCRIPTION: SEQ ID N0147: 

CAGCTAACAA 
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(2) INFORMATION FOR SEQ ID NO: 148: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base nabs 

(B) TYPE: nucleic add 

(C) 51RANDEDNESS: single 

(D) TOPOLOGY: Hneer 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:148: 
AGCTAACAAT 

(2) INFORMATION FOR SEQ IDNO:I4fc 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic add 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0Q MOLECULE TYPE: DNA 

(jd) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

GCTAACAATG 

(2) INFORMATION FOR SEQ ID Nai5(h 

(I) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 bin pain 
<B) TYPE nucleic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(w) SEQUENCE DESCRIPTION: SEQ ID NO:15Cfc 

CTAACAATGC 

(3) INFORMATION FOR SEQ ID NOilSI: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: m rrfric add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(S) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO:131: 

TAACAATGCT 

C2) INFORMATION FOR SBQ IDN0452: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGIH: lOtasepeta 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 
AACAATGCTG 

(2) INFORMATION FOR SEQ ID NO:I53: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: stngte 
(D) TOPOLOGY: Enear 

00 MOLECULE TYPE: DNA 

(jd} SEQUENCE DESCRIPTION: SEQ ID NO 1 53: 

ACAATCCTGC 

CZ) INFORMATION FOR SEQ ID NO: 1 54: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mtcleicadd 

(O STRANDEDNESS: single 
(D) TOPOLOGY: Enear 

(H) MOLECULE TYPE' DNA 

fcd) SEQUENCE DESCRIPTION: SEQ ID NO:154: 

CAATGCIGCT 

(2) INFORMATION FOR SEQ ID NO:155: 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

<Q STRANDEDNESS: single 
(D) TOPOLOGY: toear 

<n) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:155: 

AATGCIQCTT 

(Z> INFORMATION FOR SEQ ID NO: 3 56: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: noctetc add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ TD NO: 156: 

ATGCIGCTPG 

(2) INFORMATION FOR SEQ ID NO: 1 57: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: rmdete acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linen 

fl) MOLECULE TYPE; DNA 

M SEQUENCE DESCRIPTION: SEQ ID NO: 157: 

TGCTGCTTGT 

(2) INFORMATION FDR SEQ ID NO: 158: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObtsepain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N&158: 
GCTGCTIGTG 

C2) INFORMATION FOR SEQ ID NO: 159: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(P) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:159: 

CTGCTTCTGC 

C2) INFORMATION FOR SEQ ID NO:160: 

Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE nocleto acid 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: ttnear 

00 MOLECULE TYPE: DNA 

Cd> SEQUENCE DESCRIPTION: SEQ ID NO:160: 

TGCITCTCCC 

(2) INFORMATION FOR SEQ ID NO: 1 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 (use pan 

(B) TYPE: nscleie acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

. Ql) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16l: 
GCTTGTGCCT 

(2) INFORMATION FOR SEQ ID NO: 162: 

(J) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(19 MOLECULE TYPE: DNA 

(u) SEQUENCE DESCRIPTION: SEQ ID N0:162: 

CI'IUIOCCTG 

(2) INFORMATION FOR SEQ ID N&163: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYFE nucleic acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: linear 

(D) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:163: 

TTGTGCCTGG 

(2) INFORMATION FOR SEQ ID NO;164: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base nabs 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(fl) MOLECULE TYPE" DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 
TGTOCCTGGC 

(2) INFORK^ATION FOR SEQ ID NO: 165: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE DNA 

. (xi) SEQTJENCE DESCRIPTION: SEQ ID NO: 1 65: 

GTGCCIGGCT 

(2) INFORMATION FOR SEQ ID NO.166: 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGIH: 10 base pahs 

(B) TYPE' nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 1 



(a) MOLECULE TYPE: DNA 
<xi) SEQUENCE DESCRIPTION: SBQ ID NO:166: 
TGOCRXKTA 

(2) INFORMATION FOR SEQ ID NO:167: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: noddc add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: 



(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 
GCCTGGCIXG 

(2) INFORMATION FOR SEQ ID NO:168: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE' nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: 



(ii) MOLECULE TYPE: UNA 
Od) SEQUENCE DESCRIPTION: SEQ ID N0468: 
CCTGGCTAGA 

(2) INFORMATION FOR SEQ IDNO:169: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic and 

(O STRANDEDNESS: single 
(D) TOPOLOGY: Bnear 

(D) MOLECULE TYPE: DNA 

frQ SEQUENCE DESCRIPTION: SEQ ID NCfcl69: 

CTOGCTAGAA 

(2) INFORMATION FOR SEQ ID NO:170c 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYFE: nndefcacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fincar 

00 MOLECULE TYPE: DNA 

0$ SEQUENCE DESCRIPTION: SEQ IDNO:170: 
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TGGCTAGAAQ 

(2) INFORMATION FOR SEQ ID NO: 171; 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

<C) STRANDEDNESS: single 
<D)'TOPOLOGY: linear 

(ft) MOLECULE TYPE: DNA 

(n) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

GGCTAGAAGC 

(2) INFORMATION FOR SEQ ID NO: 1 72: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pafas 

(B) TYPE: nudric add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(U) MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID NCfc272: 
GCTAQAAGCA 

(2) INFORMATION FOR SEQ ID NO: 173: 

(E) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(fl) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
CTAGAAGCAC 

(2) INFORMATION FOR SEQ ID NCH74: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nsdele acid 

(C) STUANDEDNESS: stage 

(D) TOPOLOGY: Kmear 

(D) MOLECULE TYPE: DNA 
Op) SEQUENCE DESCRIPTION: SEQ ID NCM74: 
TAGAAGCACA 

CZ) INFORMATION FOR SEQ ID NO: 175: 

{i)SEQUDTCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nnrinicarirf 
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(Q STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 

W) SEQUENCE DESCRIPTION: SEQ ID Nai75: 

AGAACCACAA 

(2) INFORMATION FOR SEQ ID NO: 1 76: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS; single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

GAAGCACAAG 

(2) INFORMATION FOR SEQ ID NO:I77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE; DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:I77: 

AAOCACAAGA 

(Z) INFORMATION FOR SEQ ID NO:17fc 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: tO base pass 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ£DNO:178: 
AGCACAAGAO 

(2) INFORMATION FOR SEQ ID NO:i79: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: abgle 

(D) TOPOLOGY: Unear 

(H) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:179: 
QCACAAQAGO 



(2) INFORMATION FOR SEQ ID MfclUk 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ISO: 

CACAAGAGGA 

(2) INFORMATION FOR SEQ ID NO:! 81: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE; nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Co) MOLECULE TYPE* DNA 

(xi) SEQUENCE DESCRIPTION: SEQE>NO:lll: 

ACAAGAOGAG 

(2) INFORMATION FOR SEQ ID NO:l*2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 taw pairs 

(B) TYPE nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(fl) MOLECULE TYPE DNA 
{*D SEQUENCE DESCRIPTION: SEQ ID NO:182: 
CAAGAGGAGG 

(2) INFORMATION FOR. SEQ ID N0:183: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepaizs 

(B) TYPE nodetc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA - 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:183: 
AAGAGGAOQA 

(2) INFORMATION FOR SEQ ID NO:lS4: 

(i) SEQUENCE OIARACIERISTtCS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nnckac add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: i 
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(B) MOLECULE TYPE: DNA 
00) SEQUENCE DESCRIPTION: SEQ ID NO;lS4; 
AGACOAGGAA 

(2) INFORMATION FOR SEQ ID NChlM: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepda 

(B) TYPE nudeic add 

(Q STRANDEDNESS: dngfe 
(D) TOPOLOGY: Knear 

(5) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:185: 

GAGGAOGAAQ 

(2) INFORMATION FOR SEQ ID NO:18& 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paha 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

tf) MOLECULE TYPE: DNA 

(ad) SEQUENCE DESCRIPTION: SEQ ID NO:I86: 

AGGAGGAAGA 

(2) INFORMATION FOR SEQ ID NO:187: 

© SEQUENCE CHARACi BUS I ICS: 

(A) LENGTH: 10 base pals 

(B) TYPE: nucleic add 

<Q STRANDEDNESS: single 

(D) TOPOLOGY: fincar 

(0) MOLECULE TYPE: UNA 
(*0 SEQUENCE DESCRIPTION: SEQ ID NO:lS7: 
GGAGGAAGAG 

(2) INFORMATTON FOR SEQ ID NO&lftfc 

© SEQUENCE CHARACTERISTICS; 

(A) LENGTH: lObasopein 

(B) TYPE: nudeic add 

(R STRANDEDNESS: linglt 
(D) TOPOLOGY: fineer 

CO MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:l 88: 

GAGGAAGAGG 

(2) INFORMATION FOR SEQ ID NM89: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
(a) SEQUENCE DESCRIPTION: SEQ ID NO:l«9: 
AGGAAGAGGT 

(2) INFORMATION FOR SEQ ID NO: 190: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: naddc add 

(€) STRANDEDNESS: etngfe 
(D) TOPOLOGY: linear 

(ti) MOLECULE TYPE DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NChl90: 

GGAAGAGGTG 

(Z) INFORMATION FOR SEQ ID NO: 191: 

(I) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
0) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY; linear 

(U) MOLECULE TYPE: DNA 
(n) SEQUENCE DESCRIPTION: SEQ ID NO:191: 
GAAGAGGTGG 

(2) INFORMATION FOR SEQ ID NO:192: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGIH; 10 base pma 

(B) TYPE: nudeie add 

<Q STRANDEDNESS: single 
QD) TOPOLOGY; linear 

(H) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

AAGAGGTGGG 

(2) INFORMATION FOR SEQ ID NO: 193: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pdri 

(B) TYPE: nudeic add 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: 

(D) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 
AGAGGTGQOT 

(2) INFORMATION FOR SEQ ID NO: 194: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: mickie add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:194: 

GAGGTGGGTT 

(Z) INFORMATION FOR SEQ ID NO:195: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bra pairs 

(B) TYPE: nudeb odd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Linear 

(fl) MOLECULE TYPE: UNA 

(zl) SEQUENCE DESCRIPTION: SEQ ID NO:195: 

AGCjIGGGITT 

(2) INFORMATION FOR SEQ ID NO: 196: 

(I) SEQUQICE CHARACTERISTICS: 

(A) LENGTH: 10 hue pairs 

(B) TYPE imcldoacid 

<C) STRANDEDNESS: tingle 
(D) TOPOLOGY: tincnr 

(U) MOLECULE TYPE: UNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID N0196: 

GGTGGGTTTT 

© INFORMATION FOR SEQ ID N0197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bus pitas 

(B) TYPE: nuddcarid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQIDNO:W7: 

GTGGGTnTC 

(2) INFORMATION FOR SEQ ID N019S: 

(1) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 toe pate 

(B) TYPE: nudeie add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fit) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:198: 
TGGGTTTTDC 

(2) INFORMATION FOR SEQ ID NO:199: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:199: 

GGGmTCCA 

(2) INFORMATION FOR SEQ ID NO200: 

(O SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mideie add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO200: 

GGTTTTCCAG 

(2) INFORMATION FOR SEQ ID NO201 : 

(1) SEQUQ4CE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: auddo add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQIDNO301: 
OTmCCAOT 

(2) INFORMATION FOR SEQ 0> NO202: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: sndde add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

£d) SEQUENCE DESCRIPTION: SEQ IDNCh202: 
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TTTTCCAGTC 

(2) INFORMATION FOR SEQ ID NO203: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 lose pairs 

(B) TYPE: tmddc idd 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linetr 

00 MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ IDNO203; 

TTTCCAGTCA 

(2) INFORMATION FOR SEQ ID NO204: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: SDckteesld 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 



(11) MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQ IDNO&04: 
TTCCAGTCAC 

C2) INFORMATION FOR SEQ ED NO:205: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE; nuddc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

Gri) SEQUENCE DESCRIPTION: SEQ ID NO205: 

TCCAGICACA 

(2) INFORMATION FOR SEQ ID NO206: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pass 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

fci) SEQUENCE DESCRIPTION: SEQ ID NOaOfc 

CCAGTCACAC 

£) INFORMATION FOR SEQ ID NQ207: 

$ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: amebic add 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO207: 
CAGTCACACC 

(2) INFORMATION FOR SEQ ID NO209: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: alngle 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(*J) SEQUENCE DESCRIPTION: SEQIDNQ208; 
AGTCACACCT 

(2) INFORMATION FOR SEQ ID NO309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS; single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(ti) SEQUENCE DESCRIPTION: SEQ ID NO209: 

GTCACACCTC 

a) INFORMATION FOR SEQ ID N02IO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pafat 

(B) TYPE: nadele acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: finear 

GO MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ ID NO210: 

TCACACCTCA 

(2) INFORMATION FOR SEQ ID NOSH: 

© SEQUENCE CHARAC1KK1STICS: 

(A) LENGTH: 10 base pain 

(B) TYPE- gndric acid 

CQ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

- (19 MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N02U: 

CACACCICAG 
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(2) INFORMATION FOR SEQ ID N02I2: 

(I) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 10 base pairs 

(B) TYFE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

fl) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0212: 
ACACCTCAGG 

(2) INFORMATION FOR SEQ ID N0213: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pairs 

(B) TYPE: trackic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

00) SEQUENCE DESCRIPTION: SEQ ID N0213: 

CACCTCAOGT 

(2) INFORMATION FOR SEQ ID N0214: 

© SEQUENCE CHARACTERISTICS: 

(A) iENGTH; 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
Cri) SEQUENCE DESCRIPTION: SEQ ID N0214: 
ACCTCAGGTA 

(2) INFORMATION FOR SEQ ID N021S: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 ban past 

(B) TYPE: nudete add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<W MOLECULE TYPE: DNA 
(n) SEQUENCE DESCRIPTION: SEQ ID NOS15: 
CCICAGCTAC 

(2) INFORMATION FOR SEQ ID N0216: 

0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
(g) TYPE: nnddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ O N0216: 
CTCAGGTACC 

(2) INFORMATION FOR SEQ tD NO:217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: 

TCAGGTACCT 

(2) INFORMATION FOR SEQ ID NO:218; 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nnddcadd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Bnear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N02I8: 
CAGGTACCIT 

(2) INFORMATION FOR SEQ ID N0219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

01) MOLECULE TYPE DNA 

(kQ SEQUENCE DESCRIPTION: SEQ ID N0219: . 

AGGTACcrrr 

(2) INFORMATION FOR SEQ ID NO220: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Bnear 

fil) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-.220: 

GGTACCITTA 

(2) INFORMATION FOR SEQ ID N02Z1: 
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0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOboepnis 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

(U> MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0321: 

GTAOCTTTAA 

(2) INFORMATION FOR SEQ ID N0222: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tare pairs 

(B) TYPE: mxdek add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Uneir 

(H) MOLECULE TYPE: DNA 
(id) SEQUENCE DESCRIPTION: SEQ ID N0222 : 
TACCTTTAAG 

(2) INFORMATION FOR SEQ ID N0223: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: micfete acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(U) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID N0323: 
ACCITTAAGA 

(2) INFORMATION FOR SEQ ID N0324: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENC7IH: 10 base pairs 

(B) TYPE: nnclcic add 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: 1 



C3) MOLECULE TYPE' DNA 
(ri) SEQUOICE DESCRIPTION: SEQ mN0224: 
CCnTAAGAC 

(2) INFORMATION FOR SEQ ID N0225: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH' 10 base pais 

(B) TYPE: tmrielc ftdd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 1 

(8) MOLECULE TYPE: DNA 



W SEQUENCE ASCRIPTION: SEQ ID NO 225: 
CITTAAGACC 

(2) INFORMATION FOR SEQ ID N0326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic arid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



0) MOLECULE TYPE: DNA 
<M) SEQUENCE DESCRIPTION: SEQ ID N0226: 
TTTAAGACCA 

(2> INFORMATION FOR SEQ ID N0227: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOtaepaifs 

(B) TYPE: nodelfl add 

(Q STRANDEDNEK: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

fri) SEQUENCE DESCRIPTION: SEQIDN0227: 

TTAAQACCAA 

(2) INFORMATION FOR SEQ ID N022* 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic arid 

CP) STRANDEDNESS: single 
CD) TOPOLOGY: Uneir 

(H) MOLECULE TYPE: DNA 

fri> SEQUENCE DESCRIPTION: SEQ ID N022& 

TAAGACCAAT 

(2) INFORMATION FOR SEQ ID N0229: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bete pain 

(B) TYPE: ouctejc tcid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: fioear 

CB) MOLECULE TYPE UNA 

0ri) SEQUENCE DESCRIPTION: SEQ ID N0229: 

AAOACCAATG 

C2) INFORMATION FOR SEQ ID NO230: 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH* 10 base pairs 

(B) TYPE: nucWc add 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE- DMA 

fci) SEQUENCE DESCRIPTION: SEQ ID NO230; 

AGACCAATGA 

Q) INFORMATION FOR SEQ ID N0231: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pun 

(B) TYPE: nucleic add 

(C) STRANDEDNESS; angle 
(D> TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION; SEQ ID NQ231: 

QACCAATQAC 

<2) INFORMATION FOR SEQ ID N0232: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: nuddo add 

<Q STRANDEDNESS: singlo 

(D) TOPOLOGY: Uneir 

00 MOLECULE TYPE DNA 
(xfy SEQUENCE DESCRIPTION: SEQ ID NO£32: 
ACCAATQACT 

(2) INFORMATION FOR SEQ ID NQ033: 

0) SEQUENCE CHARACTERISTICS: 
OA) LENGTH: 10 base pair* 
(B)TYPE: nudeic acid 
(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0233: 

CCAATGACTT 

C2) INFORMATION FOR SEQ ID N0434: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Qncar 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ IDNQ234: 
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CAATGACTTA 

m INFORMATION FOR SEQ ID N0235: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0235: 

AATGACTTAC 

(2) INFORMATION FOR SEQ IDN0336 

0} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeic add 

(Q STRANDEDNESS: single 
<p) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(X0 SEQUENCE DESCRIPTION: SEQ IDN0236: 

ATGACTTACA 

(2) INFORMATION FOR SEQ ID N0237: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: niddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fincar 

(n) MOLECULE TYPE: DNA 
(X0 SEQUENCE DESCRIPTION: SEQ TDN0237: 
TGACTTACAA 

(2) INFORMATION FOR SEQ ID N0238 : 

(0 SEQUENCE CHARACTERISTICS: . 
(A) LENGTH: 10 base pairs 
' (B) TYPE: nuddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ ID N0238*. 
GACTTACAAG 

(2) INFORMATION FOR SEQ ID N0239: 

(0 SEQUENCE OIARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudeic acid 
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CO STRANDEDNESS: tingle 
(D) TOPOLOGY; linear 

(iO MOLECULE TYPE: DNA 

(n) SEQUENCE DESCRIPTION: SEQIDN0O39: 

ACTTACAAGG 

(3) INFORMATION FOR SEQ ID NO240: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 toe pate 
(D) TYPE: nucleic add 

(C) STRANDEDNESS: angle 

(D) TOPOLOGY: linear 

(H) MOLECULE TYTC: DNA 
(xf) SEQUENCE DESCRIPTION; SEQ ID N024Ch 
CTTACAAGGC 

(2) INFORMATION FOR SEQ ID N0241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE: nndete acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N004I: 
TTACAAGGCA 

(2) INFORMATION FOR SEQ ID NOS42: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tee pain 

(B) TYPE: nudetc idd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ ID N0242: 
TACAAGGCAG 

(2) INFORMATION FOR SEQ ID NQ243: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pass 

(B) TYPE: nudeto acid 

(Q STRANDEDNESS: tmgfc 
(D) TOPOLOGY: Baear 

(n) MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ IDN0243: 

ACAAGGCACC 



(2) INFORMATION FOR SEQ ID N0244: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tee pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: singe 
(D) TOPOLOGY: Baear 

00 MOLECULE TYPE: 0NA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N0244: 

CAAOGCAGCT 

(2) INFORMATION FOR SEQ ID KOOAS: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 
{B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Enear 

GO MOLECULE TYPE; DNA 
(X0 SEQUENCE DESCRIPTION: SEQ1DN0245: 
AAGGCAGCTG 

(2) INFORMATION FOR SEQ ID N024& 

0) SEQUENCE CHARACTERISTICS: . 
<A) LENGTH: 10 bate pans 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: atagk 
<D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ IDN0346: 

AGGCAGCTGT 

CD INFORMATION FOR SEQ ID NO£47: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOtecpabs 

(B) TYPE: nucleic add 

CO STRANDEDNESS: tingle 
<D) TOPOLOGY: linear 

(fl) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0247: 

GGCAGCTCTA 

(2) INFORMATION FOR SEQ ID N024& 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH' 10 base pain 

(B) TYPE: zmdeie add 

(C) STRANDEDNESS: tingie 

(D) TOPOLOGY: linear 
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(U) MOLECULE TYPE* DMA 
(xl) SEQUENCE DESCRIPTION: SEQ ID N0348: 
GCAGCTGTAG 

(2) INFORMATION FOR SEQ ID NOr249; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pais 

(B) TYPE: nucleic icid 

(Q STRANDEDNESS: sizzle 
(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQIDN0249: 

CAGCTGTAGA 

(2) INFORMATION FOR SEQ ID NO2S0: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE; nttdelc add 

(Q STRANDEDNESS; single 
(D) TOPOLOGY: 



fu) MOLECULE TYPE: DNA 
(xx) SEQUENCE DESCRIPTION: SEQ ID NO450: 
AQCTOTAOAT 

C2) INFORMATION FOR SEQ ID N0351: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
CD) TOPOLOGY: I 



(H) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO-.251: 
GCTCTAOATC 

(2) INFORMATION FOR SEQ ZD N02S2: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pain 

(B) TYPE: uidete acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

GD MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NOOS2: 
CTCTAOATCT 

(2) INFORMATION FOR SEQ ID NG2S3: 



(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pairs 

(B) TYPE* nucleic add 

(C) STRANDEDNESS: tingle 
P) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0253; 

TGTAGATCTr 

(2) INFORMATION FOR SEQIDN0254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleie acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQH>NO:254: 
GTAGATCITA 

9) INFORMATION FOR SEQ ID NO:255: 

0 SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pain 

(B) TYPE: mtdeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fil) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID N0255: 
TAGATCTTAG 

(2) INFORMATION FOR SEQ ID N0236: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGIH: 10 base pain 
9) TYPE: imnlnin acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Untar 

(H) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID N0256: 
AGATCTTAGC 

(2) INFORMATION FOR SEQ ID N02S7: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: I 

00 MOLECULE TYPE: DNA 
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Cd) SEQUENCE DESCRIPTION: SEQ ID N0257: 
GATUTTAGCC 

(2) INFORMATION FOR SEQ ©NOOSS: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) ^TfiPE Dudcic add 

(Q STRAND EDNESS: single 
<0) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(*D SEQUENCE DESCRIPTION: SEQ ID N0258: 

ATCTTAGCCA ' 

(2) INFORMATION FOR SEQ ID NOS59: 

Q) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mtdete acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(3) MOLECULE TYPE; DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0O59: 

TCTTAGCCAC 

(2) INFORMATION FOR SEQ ID NO260: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(G) STRANDEDNESS: tingle 
P) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

W) SEQUENCE DESCRIPTION: SEQ IDNQ260: 

CTTAGCCACT 

(2) INFORMATION POT. SEQ ID N0261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ID MOLECULE TYPE DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID N026 1 : 
TTAGCCACTT 

(2) INFORMATION FOR SEQ ID N026& 

0) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pain 

(B) TYPE nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

fu) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0262: 
TAGCCACTTT 

(Z) INFORMATION FOR SEQ ED N0263: 

(5) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE* nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(3) MOLECULE TYPE BNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0263: 

AOCCAcnrr 

(2) INFORMATION FOR SEQ ID N0264: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: tingle 
(P) TOPOLOGY: 



(li> MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0264: 
GCCACmTT 

(2) INFORMATION FOR SEQ ID N0265 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuddc acid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: 



(H) MQLEOTLB TYPE DNA 
(ri) SEQUENCE DESCRIPTION: SEQ ID N0265: 
GCACf I'MTA 

(2) INFORMATI ON FOR SEQ ID N0366: 

0) SEQUENCE CHARACTERISTICS: 
WUSOTH: lObsepaizi 

(B) TYPE nncieic add 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: Bocar 

(H) MOLECULE TYPE DNA 

fed) SEQUENCE DESCRIPTION: SEQIDN0266: 
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CA C1UUA A 

0) INFORMATION FORSEQ ID N0267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: midefo add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xj) SEQUENCE DESCRIPTION: SEQ ID N0067: 
ACTTTTTAAA 

(2) INFORMATION FOR SEQ ID N0268: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: suckle add 

(C) STRANDEDNESS: single 
{D)TOKHjOOY: linear 

(19 MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID N026S : 

C1TMTAAAA 

(2) INFORMATION FOR SEQ ID N0369: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudcic add 

(C) STRANDEDNESS: liable 

(D) TOPOLOGY: linear 

01) MOLECULE TYPE: DNA 
(sJ) SEQUENCE DESCRIPTION: SEQ ID N0269: 
nTTTAAAAG 

(2) INFORMATION FOR SEQ EDNO270: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 
GO TYPE: nucleic add 

(C) STRANDEDNESS: single 
03) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(kQ SEQUENCE DESCRIPTION: SEQ ID NO270: 

TTTTAAAAGA 

(2) INFORMATION FOR SEQ ID N0271: 

(0 SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 10 bate pain 

(B) TYPE: nudelc add 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0271 : 

TTTAAAAGAA 

a) INFORMATION FOR SEQ ID N0272: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

<xQ SEQUENCE DESCRIPTION: SEQ ID N0272: 

TTAAAAGAAA 

(2) INFORMATION FOR SEQ ID N0273 : 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE; nucleic add 

(C) STRANDEDNESS: singSc 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0273: 
TAAAAGAAAA 

(2) INFORMATION FOR SEQ ID NOSH: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuddcadd 

(q STRANDEDNESS: single 
(D) TOPOLOGY: fines 

tf) MOLECULE TYPE: UNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0274: 

AAAAGAAAAG 

(2) INFORMATION FORSEQ ID N0275: 

(I) SEQUENCE CHARACTERISUCS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<H) MOLECULE TYPE: DNA 
(XQ SEQUENCE DESCRIPTION: SEQ ID NOS75: 
AAAGAAAAGG 
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(2) INFORMATION FOR SEQ ID N0276; 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pain 

(B) TYPE: nucfeto add 

(Q STRANDEDNESS: single 
(P) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

W) SEQUENCE DESCRIPTION: SEQ ID NCh276: 

AAOAAAAQGO 

(2) INFORMATION FOR SEQ ID N0277: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 esse pairs 

(B) TYPE: naddc edd 

(Q STRANDEDNESS: single 
CD) TOPOLOGY; linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0377: 

AQAAAAOGGG 



(2) INFORMATION FOR SEQ ID N027& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bee pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(XQ SEQUENCE DESCRIPTION: SEQ ID N0378; 
GAAAAGGGGG 



(D) MOLECULE TYPE* DNA 
(d) SEQUENCE DESCRIPTION: SEQ ID NO*280: 
AAAGGGGGGA 

(2) INFORMATION FOR SEQ ID N02S1: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 esse pain 

(B) TYPE: rocktc tdd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID N0381: 

AAGGGGGQAC 

<2} INFORMATION FOR SEQ ID NMS2: 

fl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N02&; 
AGGGGGGACT 



(2) INFORMATION FOR SEQ ID N02J3; 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObttepdn 

(B) TYPE: nucleic add 

(€) STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID N0279: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOtoepdn 

(B) TYPE: nude ic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYi linear 

(D) M0LECU12 TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ IDN0279: 
AAAAGGGGGG 



(2) INFORMATION FOR SEQ ID NOOSO: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENOIR lObasepdrs 

(B) TYP& oadekadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



00 MOLECULE TYPE* DNA 
£ti) SEQUENCE INSCRIPTION: SEQ ID N0283: 
GGGGGGACTG 

CZ) INFORMATION FOR SEQ ID N0284: 

(5) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pain 

(B) TYPE: nsckk add 

(C) STRAN DEDNESS: single 

(D) TOPOLOGY: Bnear 

(B) MOLECULE TYPE: IMA 
W SEQUENCE DISCRETION: SEQ EDN02S4: 
GGCX5GACTGG 

(2) INFORMATION FOR SEQ ID N02S5 : 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudde add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0285: 

OGGGACTGGA 

CD INFORMATION FOR SEQ ID N0286: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pals 

(B) TYPE: oneiric add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(II) MOLECULE TYPE DNA 

<xQ SEQUENCE DESCRIPTION: SEQ ID N0286: 

COQACTGGAA 

Q) INFORMATION FOR SEQ ID N0287: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ IDN0287: 

GQACTGGAAG 

P) INFORMATION FOR SEQ ID N0288: 

(9 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
0)TYPE: nucleic acid 
(Q STRANDEDNESS: single 
03} TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

(si) SEQUENCE DESCRIPTION: SEQ ID HQ388: 

GACTGGAAGG 

(2) INFORMATION FOR SEQ ID N0389: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 baas pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 MOLECULE TYPE: DNA 



£ri) SEQUENCE DESCRIPTION: SEQ ID N02&9: 
ACTOGAAGGG 

(2) INFORMATION FOR SEQ ID NO390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

Od) SEQUENCE DESCRIPTION: SEQ ID NO290: 

CTGGAAGGGC 

(2) INFORMATION FOR SEQ ID N0291: 

(5) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE DNA 
(jd) SEQUENCE DESCRIPTION: SEQ fl> NCK291: 
TGGAAGGGCT 

(2) INFORMATION FOR SEQ ID N0292: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pans 
(EQTYPE: nudeic add 
(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

tf) SEQUENCE INSCRIPTION: SEQIDN0.292: * 

GGAAGGGCTA 

(Z) INFORMATION FOR SEQ JDN0293; 

(0 SEQUENCE CHARACTERISTTCS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudde acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID N0093: 
GAAGGGCTAA 

CZ) INFORMATION FOR SEQ ID N0294: 
(Q SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pain 

(B) TYPE: ncdrie add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0294: 

AAGGGCTAAT 

(2) INFORMATION FOR SEQ ID ND295: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pass 
(B) TYPE: nucleic add 
(Q STRANDEDNESS: tingle 
<P) TOPOLOGY: linear 

00 M<H£CULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N029S: 

AOGGCTAATT 

P) INFORMATION FOR SEQ ID N0296: 

fl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic kU 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: tinear 

00 MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID N029& 

GGGCTAATTC 

(2) INFORMATION FOR SBQ ID N0297: 

(p SEQUENCE CHARACTERISTICS: 

(A) LEN(7IH: 10 base pitas 

(B) TYPE nudefc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: I 



(H) MOLECULE TYPE: DNA 
(*i) SEQUENCE DESCRIPTION: SEQ ID N0297: 
GGCEAATTCA 

(2) INFORMATION FORBID N0298: 

CD SEQUB4CE CHARACTERISTICS: 

(A) LENGTH: 10 be* pahs 

(B) TYPE imrinicadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(2) MOLECULE TYPE: UNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID N029& 
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GCTAATTCAC 

C2) INFORMATION FOR SEQ IDNO-299: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: mtdeic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

0D MOLECULE TYPE: DNA 

(id) SEQUENCE DESCRIPTION: SEQ ID NO:299; 

CTAATTCACT 

(2) INFORMATION FOR SEQ ID NOJ00: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pass 

(B) TYPE: nudete acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(u) MOLECULE TYPE: DNA 
(x5) SEQUENCE DESCRIPTION; SEQ ID NO300: 
TAATTCACTC 

(2) INFORMATION FOR SEQ ID NO301: 

® SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pass 
<B) TYPE: mtdeic add 

(Q STRANDEDNESS: rlngk 
(P) TOPOLOGY: Qnear 

(H) MOLECULE TYPE* DNA 

Crd) SEQUENCE DESCRIPTION: SEQ ID ND301: 

AATTCACTOC 

(2) INFORMATION FOR SEQ ID N03G2: 

0) SEQUENCE CHARACTERISTIC: 
CA) LENGTH: 10 base pain 

(B) TYPE: oodeic acid 

<R STRANDEDNESS: single 
<D) TOPOLOGY: Iteear 

(D) MOLECULE TYPE DNA 

M) SEQUENCE DESOUPTTON: SEQ IDNO-JQ2: 

ATTCACTCCC 

(2) INFORMATION FOR SEQ ID NO303: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE* mtdeic add 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(2) MOLECULE TVPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO303: 

TTCACTCCCA 

(2) INFORMATION FOR SEQ ID NO304: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: finear 

MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO304: 
TCACTCCCAA 

(2) INFORMATION FOR SEQ ID NO305: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pah* 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ05: 

CACTCCCAAA 

(2) INFORMATION FOR SEQ ID NO30& 

<D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOfewpnn 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singte 

(D) TOPOLOGY: (iseir 

(a) MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ IDNO306: 
ACTCCCAAAG 

(Z) INFORMATION FOR SEQ ID NO307: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuckic add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

OQ MOLECULE TYPE: DNA 

en) SEQUENCE DESCRIPTION: SEQ IDNO307: 

CTOCCAAAGA 



C2) INFORMATION FOR SEQ ID NOJ08; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pant 

(B) TYPE: nucldc add 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

Gi) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO308: 

TCCCAAAGAA 

(2) INFORMATION FOR SEQ ID NOJ09: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucldc add 

{Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO309: 

CCCAAAGAAG 

(2) INFORMATION FOR SEQ ID NOJIO: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleus add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: iiceir 

(u) MOLECULE TYPE: DNA 

(») SEQUENCE DESCRIPTION: SEQ ID NO310: 

CCAAAOAAGA 

(2) INFORMATION FOR SEQ ID N03 1 1 : 

(Q SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
* (B) TYPE: mtckfe add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fl) MOLECULE TYPE: DNA . 

Od) SEQUENCE DESCRIPTION: SEQ ID N03U: 

CAAAGAAGAC 

(2) INFORMATION FOR SEQ ID NOJ 12: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYFE: suddcadd 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 
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(5) MOLECULE TYPE: DNA 
(xJ) SEQUENCE DESCRIPTION: SEQ ID NOJ12; 
AAAGAAGACA 

(2) INFORMATION FOR SEQ ID NOSI3: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: middc add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N03U: 

AAGAAOACAA 

(2) INFORMATION FOR SEQ ID N0314: 

(J) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(6) TYPE: nudde tcld 
(Q STRANDEDNESS: tingle 

(D) TOPOLOGY: Hnetr 

(H) MOLECULE TYPE' DNA 
On) SEQUENCE DESCRIPTION: SEQ ID N03I4: 
AGAAGACAAG 

(2) INFORMATION FOR SEQ ID N0315: 

(I> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



<E) MOLECULE TYPE: DNA 
M) SEQUENCE DESCRIPTION: SEQ ID N0315: 
GAAOACAA0A 

(2) INFORMATION FOR SEQ ID N0316: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic icid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(B) MQLBCUIE TYPE: DNA 

(4) SEQUENCE DESCRIPTION: SEQ ID N03J5: 

AAGACAAGAT 

(2) INFORMATION FOR SEQ ID NO"J 17: 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
•(B) TYPE: nucleic tcld 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

©MOLECULE TYPE* DNA 

<ri) SEQUENCE DESCRIPTION: SEQ ID N0317: 

AOACAAGATA 

(2) INFORMATION FOR SEQ ID NO -3 18: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 fane pain 

(B) TYPE DBdeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 
{x0 SEQUENCE DESCRIPTION: SEQ ID NOJ18: 
OACAAGATAT 

(2) INFORMATION FOR SEQ ID NO J 19: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pahs 

(B) TYPE racJek Kid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<5) MOLECULE TYPE: DNA 

<w) SEQUENCE DESCRIPTION: SEQ ID NO* 19: 

ACAAGATATC 

(2) INFORMATION FOR SEQ ID NO320: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nucleic add 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE DNA 
On) SEQUENCE DESCRIPTION: SEQ ID NO-J20: 
CAAGATATCC 

(2) INFORMATION FOR SEQ ID N0521: 

CO SEQUDICE CHARAC1LK1SUCS: 

(A) LENGTH: 10 base pais 

(B) TYPE: totcfete add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 

00 MOLECULE TYPE DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0321: 
AAGATATCCT 

(2) INFORMATION FOR SEQ ID N0522: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE; nudeic add 

(O STRANDEDNESS: single 
(D) TOPOLOGY; finear 

Cu) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQZDN0322: 

AGATATCCTT 

(2) INFORMATION FOR SEQ ID N0323: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: Buckie add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



fu) MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID N0323: 
GATATCCTTG 

(2) INFORMATION FOR SEQ ID ND324: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: miciefe acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

01) MOLECULE TYPE: DNA 

<xl) SEQUENCE DESCRIPTION: SEQ TDN0324: 

ATATCCTTGA 

a) INFORMATION FOR SEQ ID N032S : 

(9 SEQUENCE CHARACTERISTICS: 
(A) LENGTH* 10 base pairs 
(BO TYPE: nucleic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0325: 

TATOCTTGAT 

O) INFORMATION FOR SEQ ID NO-J26: 

© SEQUENCE CHARACIERISTTCS: 



(A) LENGTH: 10 base pain 

(B) TYPE: ondek add 

(Q STRANDEDNESS: tingle 
P) TOPOLOGY: linear 

<fl) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID N0326: 

ATCCTTGATC 

C2) INFORMATION FOR SEQ ID N0327 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nncWe add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0327: 

TCCTTGATCT 

(2) INFORMATION FOR SEQ ID N032& 

© SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
9) TYPE: nodeieadd 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

M) SEQUENCE DESCRIPTION: SEQ ID N0328: 

CCTTGATCTO 

(2) INFORMATION FOR SEQ ID N0329: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGFIH: 10 base pass 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
£tQ SEQUENCE DESCRIPTION: SEQ ID N032£ 
CITGATCTGT 

(2) INFORMATION FOR SEQ ID NO330: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: middoaeid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: finear 

(u) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO330: 
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TTGATCTGTG 

(2) INFORMATION FOR SEQ ID N0J31; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pass 

(B) TYPE: micfehicid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(H) MOLECULE TYPE: DNA 
(3d) SEQUENCE DESCRIPTION: SEQXDN0331: 
TOATCTGTGG 

(2) INFORMATION FOR SEQ ID NOS32: 

. (D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<Q STRANDEDNESS: single 
<D) TOPOLOGY:] 



(B) MOLECULE TYPE: DNA 
(ti) SEQUENCE DESCRIPTION: SEQ ID NO*J32: 
OATCTOTOOA 

(2) INFORMATION FOR SEQ ID ND333: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pins 

(B) TYPE: lmdefe tdd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DMA 

flu) SEQUENCE DESCRIPTION: SEQ IDN0333: 

ATCTGTCGAT 

(2) INFORMATION FOR SEQ ID NO-.334: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 hue piss 

(B) TYPE: nudefcadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Battr 

(fi) MOLECULE TYPE: DNA 
{xQ SEQUENCE DESCRIPTION: SBQ ID N0334: 
TCTT7IGGATC 

(2) INFORMATION FOR 5EQ ID K0335: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObaeptbs 

(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: fiaear 

(ii) MOLECULE TYPE: DNA 
<xl) SEQUENCE DESCRIPTION: SEQ ID NOJ35: 
CTGTGGATCT 

(2) INFORMATION FOR SEQ ID N0336; 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nadexe acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fines- 

(D) MOLECULE TYPE DNA 
(n) SEQUENCE DESCRIPTION: SEQ fl> NOS36: 
TGTGGATCTA 

(2) INFORMATION FOR SEQ ID N0337: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: mtcklc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SBQ ID NO:337: 
GTGGATCTAC 

(2) INFORMATION FOR SEQ IDNOJ38: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base palm 
(BE) TYPE: nucleic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(jri) SEQUENCE DESCRIPTION: SEQ ID N033 fc 

TGGATCTACC 

(2) INFORMATION FOR SEQ ID N0339: 

(9 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base patas 
• (B) TYPE: nu c fri e add 
(Q STRANDEDNESS: singes 
TO TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xD SEQUENCE DESCRIPnON: SEQ IDN0339: 

GGATCTACCA 
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(2) INFORMATION FOR SEQ ID NO340: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mideic idd 

(Q STRANDEDNESS; tingle 
(D) TOPOLOGY: linear 

CO MOLECULE TOE; DMA 

Ori) SEQUENCE DESCRIPTION: SEQ ID NOiMO: 

QATCTACCAC 

(2) INFORMATION FOR SEQ ID NQ041: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 but pairj 

(B) TYFE: nucleic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY; Hdcit 

(li) MOLECULE TYPE; DNA 
fxO SEQUENCE DESCRIPTION: SEQ ID N0341: 
ATCTACCACA 

C2) INFORMATION FOR SEQ ID NO:342: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

fri) SEQUENCE DESCRIPTION; SEQ ID N0342: 

TCTACCACAC 

0) INFORMATION FOR SEQ ID N0343 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 have pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0343: 

CTAOCACACA 

(Z) INFORMATION FOR SEQ ID N0 J44: 

0) SEQUENCE CHARACTERISTICS: 

(A) Want lObasepabi 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 



117- 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ 1DN0344: 
TACCACACAC 

(2) INFORMATION FOR SEQ ID N0345: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepafav 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-J43: 
ACCACACACA 

(2) INFORMATION FOR SEQ ID N0346: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

<xl) SEQUENCE DESCRIPTION: SEQ IDN034& 

CCACACACAA 

(2) INFORMATION FOR SEQ ID N0347: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: tingle 
(D) TOPOLOGY: Itoear 

(II) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SBQ ID N0347: 
CACACACAAG 

(2) INFORMATION FOR SEQ ID N034S: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENCmi 10 base pain 

(B) TYPE: nudeio add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: Qnear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0348: 
ACACACAAGG 

(2) INFORMATION FOR SEQ ID N0349: 
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(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH lObase pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(10 MOLECULE TYPE DNA 

(n) SEQUENCE DESCRIPTION: SEQ IDN0349: 

CACACAAGGC 

(2) INFORMATION FDR SEQ ID NOJ50: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENCTIH: 10 bee pain 

(B) TYPE: nuddo add 

(Q STRANDEDNESS; single 
(D) TOPOLOGY: 1 



(Q) MOLECULE TYPE: DNA 
0(1) SEQUENCE DESCRIPTION: SEQ ID NOJ50: 
ACACAAOGCT 

(2) INFORMATION FOR SEQ ID NQ551: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 b« pairs 

(B) TYPE; nucleic add 
(P) STRANDEDNESS: 
(D) TOPOLOGY: 



01) MOLECULE TYPE: DMA 
C*J) SEQUENCE DESCRIPTION: SEQ IDNO-J5I: 
CACAAOOCTA 

(2) INFORMATION FOR SEQ TD N0552: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pain 

(B) TYPE: nndcJe tcid 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: fifett 

(3) MOLECULE TYPE: DNA 

<x|) SEQUENCE DESCRIPTION: SEQ IDN03S2: 

ACAAGGCTAC 

p) INFORMATION FOR SEQ ID NOJ53 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: roddc «dd 

(Q STRANDEDNESS: xtogle 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 



(xi) SEQUENCE INSCRIPTION: SEQ ID N05S3: 
CAAGGCTACT 

(2) INFORMATION FOR SEQ ID N0354: 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: ncdezc acid 

(Q STRANDEDNESS: txagfe 
(D) TOPOLOGY: Hnear 

(9) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION; SEQ IDN0354: 

AAGGCTACTT 

(2) INFORMATION FOR SEQ ID NOJ55: 

(0 SEQUENCE CHARACTERBTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: imdde add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ 1DN0355: 
AGGCTACTTC 

(2) INFORMATION FOR SEQ ID NO X 5& 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pah* 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

CD MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN035& 

GGCTACTTCC 

(2) INFORMATION FOR SEQ ID NOSS7: 

©SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pate 

(B) TYPE: andtic add 

(C) STRANDEDNESS: ifagfc 

(D) TOPOLOGY: 



(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNOJ57: 
GCTACTTCCC 

C2) INFORMATION FOR SEQ ID N035& 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(5) MOLECULE TYPE; DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N03 38: 

CTACTTCCCT 

(2) INFORMATION FOR SEQ ID N0339: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE: nuckio add 

(Q STRANDEDNESS: stogie 
(D) TOPOLOGY: Hneir 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNOJ59: 

TACTTCCCTG 

(2) INFORMATION FOR SEQ ID N036& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObuepaix* 

(B) TYPE: nocksc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: lbetr 

(D) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQIDNO360: 

ACTTCCCTGA 

C2) INFORMATION FOR SEQ ID N0361: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Unctr 

(0) MOLECULE TYPE: DMA 

(2d) SEQUENCE DESCRIPTION: SEQ IDN0361: 

CTTCCCTGAT 

CZ) INFORMATION FOR SEQ ID N0362: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LBtGFXH: lObsupaizs 
(D) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 

00 MOLECULE TYPE: DNA 

(id) SEQUENCE DESCRIPTION: SEQ ID N036& 
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TTCCCTGATT 

(2) INFORMATION FOR SEQ ID N0363: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: 

TCCCKjATTG 

(2) INFORMATION FOR SEQ ID NO J64 : 

0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pain 

(B) TYPE: nudcic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: Iineir 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0364: 

CCCTOATTGG 

(2) INFORMATION FOR SEQ ID NQ365: . 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudcic add 

(O STRANDEDNESS: single 
p) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(jri) SEQUENCE DESCRIPTION: SEQ ID N0365: 
CCTGATTOGC 

CZ) INFORMATION FOR SEQ ID N0366; 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(I) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0366: 
CTQATTGOCA 

(2) INFORMATION FOR SEQ ID N0367: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudcic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(3) MOLECULE TYPE: DNA 
W> SEQUENCE DESCRIPTION: SEQ IDN0367: 
TQATTGGCAO 

(2) INFORMATION FOR SEQ ID NOJ68: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 hue pahs 
<B)TVTE: nnctefoaoid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: 



(2) INFORMATION FOR SEQ ID NG372: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pans 

(B) TYPE; nocleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 1 



(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO-36S: 
GATTGGCAGA 

(2) INFORMATION FOR SEQ ID N0369: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: mtdcte add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(a) MOLECULE TYPE: DNA 
£3) SEQUENCE DESCRIPTION: SEQ ID NO-J69: 
ATTGGCAGAA 

(2) INFORMATION FOR SEQ ID NO370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tee pairs 

(B) TYPE* midd c acid 

(C) STRANDEDNESS: single 
(P) TOPOLOGY: Unear 

<H) MOLECULE TYPE: DNA 

C*D SEQUENCE DESCRIPTION: SEQ ID NO370: 

TTGGCAGAAC 

(2) INFORMATION FOR SEQ IDN0371: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nodes; sad 

CQ STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 
W SEQUENCE DESCRIPTION: SEQ IDN0371: 
TCGCAQAACT 



©MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID N0372: 
GGCAGAACTA 

(2) INFORMATION FOR SEQ ID N0373: 

© SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 bate pain 
<B) TYPE: sadeis&cirf 
(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<H) MOLECULE TYPE: DNA 

£d) SEQUENCE DESCRIPTION: SEQ ID NQ373: 

GCAGAACTAC 

(2) INFORMATION FOR SEQ IDNOS74: 

<D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pairs 

(B) TYPE: nudele add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(G) MOLECULE TYPE: DNA 

(») SEQUENCE DESCRIPTION: SEQIDNO-J74: 

CAGAACTACA 

(2) INFORMATION FOR SEQ ID N0375: 

(t) SEQUENCE CHARACTERISTICS: 
(A)UNGIH: ICbiscpans 
. (B) TYPE: nucleic add 
(Q STRANDEDNESS: single 
CP) TOPOLOGY: faesr 

09 MOLECULE TYPE: UNA 

0d) SEQUENCE DESCRIPTION: SEQ ID N0375: 

AGAACTACAC 

(2) INFORMATION FOR SEQ ID N037& 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: nucleic add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: 
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(U) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0376: 
GAACTACACA 

(2) INFORMATION FOR SEQ ID N0377: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pabs 

(B) TYPE: midde add 

(C) STRANDEDNESS: f ingle 

(D) TOPOUXjY: Eoear 

(il) MOLECULE TYPE: DNA 
(xt) SEQUENCE DESCRIPTION: SEQ ID N0577: 
AACTACACAC 

CO INFORMATION FOR SEQ ID NO 378; 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: finear 

MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0378: 
ACTACACACC 

(2) INFORMATION FOR SEQIDNOJ79: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pairs 

(B) TYPE: mtdelc add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N037* 

CTACACACCA 

(2) INFORMATION FOR SEQ ID N03 80: 

Q) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: noddc add 

(Q STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO380: 

TACACACCAO 

(2) INFORMATION FOR SEQ ID N0381: 



(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(P) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0381: 

ACACACCAGG 

(2) INFORMATION FOR SEQ ID NO:382: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pabs 

(B) TYPE nucleic add 

(Q STRANDEDNESS: stngte 
(D) TOPOLOGY: Linear 

00 MOLECULE TYPE: WA 

(x9 SEQUENCE DESCRIPTION: SEQ DDNO:382: 

CACACCAGGG 

9) INFORMATION FOR SEQ IDNO:383: 

(]) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(I!) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ CD N0383: 

ACACCAOGGC 

O) INFORMATION FOR SEQ ID N0384: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pan 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: stogie 

(D) TOPOLOGY: Bnear 

(5) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID N0384: 
CACCA0OGCC 

(2) INFORMATION FOR ffiQ ID NOJ85: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: node* acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: 

(H) MOLECULE TYPE: DNA 
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fci) SEQUENCE DESCRIPTION: SEQ ID N0385: 
ACCAGGGCCA 

a) INFORMATION FOR SEQ ID N0386: 

ffl SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pain 

(B) TYPE: roddc add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY:! 



(H) MOLECULE TYPE: DNA 
<*0 SEQUENCE DESCRIPTION: SEQ ID N03S6: 
CCAGGGCCAG 

(2) INFORMATION FOR SEQ ID N03S7: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bee puts 

(B) TYPE: naddc icld 

(Q STRANDEDNESS: liogle 
(D) TOPOLOGY: 



09 MOLECULE TYPE: DNA 
(id) SEQUENCE DESCRIPTION: SEQ IDNO-J87: 
CAGGGCCAGG 

(2) INFORMATION FOR SEQ ID N0388: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: node k add 

CP STRANDEDNESS: single 
(D) TOPOLOGY: fisear 

ffi) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS8& 

AGGGOCAOGG 

(2) INFORMATION FOR SEQ ID NO509: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 btse pus 

(B) TYPE: middeadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 

(H) MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQ IDNO-JS9: 
GGQOCAGGGG 

(2) INFORMATION FOR SEQ IDNO590: 

© SEQUENCE CHARACTERISTICS: 



(A) LENGTH: lObsseptbi 

(B) TYPE: ooddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO390: 
GGCCAGGGGT 

(2) INFORMATION FOR SEQ ID N0591: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base purs 

(B) TYPE: mideic add 
(Q STRANDEDNESS: 
(D) TOPOLOGY: 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0591: 
GOCAGGGGfIC 

(2) INFORMATION FOR SEQ ID N0392: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bast pain 

(B) TYPE: mideic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0392: 

CCAGGGGTCA 

(2) INFORMATION FOR SEQ ID N0393: 

(!) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 twe pairs 

(B) TYPE: nnri etc add 

(C) STRANDEDNESS: ilnglt 
CD) TOPOLOGY: I 



(U) MOLECULE TYPE: DNA 
Co) SEQUENCE DESCRIPTION: SBQIDNO-393: 
CAGOGGTCAG 

Q) INFORMATION FOR SEQ ID N0394: 

CD SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 

(B) TYP£: naddc add 

(C) STRANDEDNESS: dngte 
<P) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0594: 
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AGGGOTCAOA 

(2) INFORMATION FOR SEQ ID N0395: 

(!) SEQUENCE CHARACTERISTICS r 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Oi) MOLECULE TYPE; DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0393: 

GGGGTCAGAT 

(2) INFORMATION FOR SEQ ID N039& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nucleic add 

(Q STRANDEDNESS: single ' 
(D)TOraLOOY:liBear 

m MOLECULE TYPE: DNA 

0$ SEQUENCE DESCRIPTION: SEQ IDN0396: 

GGGTCAGATA 

(2) INFORMATION FOR SEQ ID N0397: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDWW97: 
GGTCA0ATAT 

(2) INFORMATION FOR SEQ ID N039& 

© SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 ba* pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Hrjear 

(II) MOLECULE TYPE DNA 
ft) SEQUENCE DESCRIPTION: SEQ IDN0398: 
CTCAGATATC 

(2) INFORMATION FOR SEQ IDNO-J99: 

(i) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 bate pairs 

(B) TYPE: rmdete acid 



(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

(19 MOLECULE TYPE: DNA 

(ti) SEQUENCE DESCRIPTION: SEQ ID N039* 

TCAGATATCC 

(2) INFORMATION FOR SEQ ID NO400: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nudefa add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(5) MOLECULE TYPE DNA 

Ori) SEQUENCE DESCRIPTION: SEQ IDNOjWO: 

CAGATATCCA 

(2) INFORMATION FOR SEQ ID NCM01: 

0) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base ptha 
(B) TYPE nucleic add 
(Q STRANDEDNESS: singe 
(D) TOPOLOGY: linear 

(D) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:401: 

AGATATCCAC 

(2) INFORMATION FOR SEQ ID NO*02: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paia 

(B) TYPE nndelc add 

(C) STRANDEDNESS: single 
0)) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4C2: 

GATATOCACT 

(2) INFORMATION FOR SEQ ID NO:403: 

0) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic add 

(JO STRANDEDNESS: single 

(D) TOPOLOGY: Knew 

(H) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:403: 
ATATCCACTG 
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(2) INFORMATION FOR SEQIDNO:404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nodoc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fri) MOLECULE TYPE: DMA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:404: 

TATCCACTQA 

(2) INFORMATION FOR SEQ ID NO* 05: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mcferc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOr405: 

ATCCACTQAC 

(2) INFORMATION FOR SEQ ID NO*06: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: single 
<D> TOPOLOGY: 



(fl) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N04D6: 
TCCACTOACC 

(2) INFORMATION FOR SEQ ID NO:407: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pais 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

ffi) MOLECULE TYPE: DNA 

(x© SEQUENCE DESCRIPTION: SEQIDNO:407: 

CCACTGACCT 

(2) INFORMATION FOR SEQ IDNOtfQE: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bass pain 

(B) TYPE: anddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(il) MOLECULE TYPE* DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:408: 
CACTQACCIT 

(2) INFORMATION FOR SEQ ID NO:409: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(fl) MOLECULE TYPE DNA 
(xQ SEQUENCE DESCRIPTION: SEQ IDNO:409: 
ACTGACCTTT 

(2) INFORMATION FOR SEQ ID NOtflO: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pdrs 

(B) TYPE* oudeJc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



(19 MOLECULE TYPE DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:410: 
CTGACCTTTG 

(2) INFORMATION FOR SEQ ID NOrfl 1: 

(0 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pairs 
(B) TYPE nucleic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE DNA 

$d) SEQUENCE DESCRIPTION: SEQ IDNO-411: . 

TGACCTTTGO 

(2) INFORMATION FOR SEQ ID NO:412: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* imdds add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:412: 
GACCTTTGGA 

O) INFORMATION FOR SEQ ID NO*J3: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:413: 

ACCTTTGGAT 

(2) INFORMATION FOR SEQ ID NO:414: 

(I) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 bow pain 
(B) TYPE nucleic acid 
(Q STRANDEDNESS: unfile 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0414: 

CCnTCQATG 

C2) INFORMATION FOR SEQ ID NCM15: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: naskfc add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: Enenr 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NCM15: 

CITTGGATGO 

C2) INFORMATION FOR SEQ ID N0416: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LB4GTH: 10 base pain 

(B) TYPE miefeSe add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

01) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDN0416 
TnGGATGGT 

(2) INFORMATION FOR SEQ ID NO:4I7: 

fl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObaseptizs 

(B) TYPE nudeic add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: 1 

. (il) MOLECULE TYPE DNA 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO.417: 
TTXK5ATGGTC 

<2) INFORMATION FOR SEQ ID NO:418; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pain 

(B) TYPE nudeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE DNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID N&4I8: 
TGGATGGTGC 

0) INFORMATION FOR SEQ ID NO:41* 

©SEQUENCE CHARACTERBTIICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nuddc add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: Boear 

(U) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4l9: 
GGATCGTOCT 

(2) INFORMATION FOR SEQ ID NO:420: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepafat 

(B) TYPE nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: Boear 

(U) MOLECULE TYPE DNA 

(xl) SEQUENCE DESCRIPTION: SEQIDNO:420: 

GATGGTGCTA 

(Z) INFORMATION FOR SEQ ID NO:42l: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 owe pairs 

(B) TYFE nockic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE DNA 

. (xt) SEQUENCE DESCRIPTION: SEQ ID NCk42l: 

ATGGTGCTAC 

(2) INFORMATION FOR SEQ ID NOC2: 
(Q SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pain 
<B) TYPE: rmcfcto acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
M SEQUENCE DESCRIPTION: SEQ ID N0422: 
TGGTGCTACA 

Q) INFORMATION FOR SEQ ID NO:423: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: iOtoe pans 

(B) TYPE: cmdelc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Kncar 

(ii) MOLECULE TYPE; DNA 
(to) SEQUENCE DESOUFTTON: SEQ ID NO:423: 
GGTGCTACAA 

(2) INFORMATION FOR SEQ ID N0424: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepalxs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(Xi) SEQUENCE DESCRIPTTON: SEQ ID NO:424: 
GTGCTACAAG 

(2) INFORMATION FOR SEQ ID N0425: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: rmcJefc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bncar 

(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQIDNO:425: 
TGCTACAAGC 

(2) INFORMATION FOR SEQ IDN042& 

0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE racfcte add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: Bncar 

® MOLECULE TYPE: DNA 

(xi) SBQUBJCE DESCRIPTION: SEQ ID NO**: 
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GCTACAAGCT 

(2) INFORMATION FOR SEQ ID NO:427: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 cue pairs 

(B) TYPE: raideic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(fi) M<H£CULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:427: 
CTACAAGCTA 

(2) INFORMATION FOR SEQ ID NO:428: 

©SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudric add 

(C) STRAN DEDNESS: tingle 

(D) TOPOLOGY: linear 

fri) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*2& 
TACAAGCTAG 

(2) INFORMATION FOR SEQ ID NOX29: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: ttudekadd 

(C) STRANDEDNESS: aingle 

(D) TOPOLOGY: fincar 

(B) MOLECULE TYPE: DNA 
(d) SEQUENCE DESCRIPTION: SEQ IDNO*29: 
ACAAGCEAGT 

(2) INFORMATION FOR SEQ ID NO:430: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE rmdeic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

Of) MOLECULE TYPE- DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: 
CAAGCTAGTA 

O) INFORMATION FORSBQ ID NO:43 1 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: tmdeic add 
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(C) STRANDEDNESS: tingle 
P) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NOs431: 

AAGCTAOTAC 

(2) INFORMATION FOR SEQ IDN0.432: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 baa: pairs 

(B) TYPE: nuclric add 

(Q STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:432: 
AGCTAGTACC 

(2) INFORMATION FOR SEQ ID NO*33: 

(0 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
(B) TYPE: nucleic acid 
(O STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

Cri) SEQUENCE DESCRIPTION: SEQ ID NO:433: 

GCTA0TACCA 

(2) INFORMATION FOR SEQ ID NO:434: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pais 

(B) TYPE: nudcic acid 

<C) SIRANDEDNESS: itngte 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:434: 

CTAGTACCAG 

C3) INFORMATION FOR SEQ ID NO*3S: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pain 

(B) TYPE: nucleic acU 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(D) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:435: 
TAGTACCAGT 



(2) INFORMATION FOR SEQ ID NO:436: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(€) STRANDEDNESS: tingle 
(0) TOPOLOGY: linear 

<U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO*36: 

AGTACCAGTT 

0) INFORMATION FOR SEQ ID NO:437: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Bsear 

00 MOLECULE TYPE: DNA 

0d) SEQUENCE DESCRIPTION: SEQ ID NO*37: 

GTACCAGTTG 

INFORMATION FOR SEQ ID NO:438: 

0) SEQUENCE- CHARACTERISTICS: 

(A) LENGTH: 10 base pass 

(B) TYPE: nndeic add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: I 



(u) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:438: 
TACCAGTTGA 

(2) INFORMATION FOR SEQ ID N0439: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 hue p tin 

(B) TYPE: nucleic acid 

(C) STRANDH2NESS: single 
<D) TOPOLOGY: linear 

(U) MOLECULE TYPE DNA 

Cd) SEQUENCE DESCRIPTION: SEQ ID NO:439: 

ACCAGTTGAG 

(2) INFORMATION FOR SEQ ID NO440: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucfclc add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: I 
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(H) MOLECULE TYPE: DNA 
(w) SEQUENCE DESCRIPTION: SEQ IDNO:440: 
OCAGTTGAGC 

(2) INFORMATION FOR SEQ ID NOrf41: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepeirs 

(B) TYPE: mtsle ic arid 

(C) STRANDEDNESS: i Ingle 

(D) TOPOLOGY: Hncsr 

(U) MOLECULE TYPE: DNA 
(jrf) SEQUENCE DESCRIPTION: SEQ ID NO:44L 
CACTTGAGCC 

(2) INFORMATION FOR SEQ ID NO:442: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nocSeicadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bncar 

(5) MOLECULE TYPE: DNA 
(xJ) SEQUENCE DESCRIPTION: SEQIDNO:442: 
AGTTGAGCCA 

(Z) INFORMATION FOR SEQ ID N0443: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 bate pain 
(BO TYPE: nucleic add 
(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(») SEQUENCE DESCRIPTION: SEQ ID NO:443 : 

QTTGAGOCAO 

(2) INFORMATION FOR SEQ ID NCM44: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pair* 

(B) TYFE: nodefcasid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

<E) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNQ444: 

TTQAGCCAGA 

(2) INFORMATION FOR SEQ ID N0445: 



0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: irnddc arid 

(C) STRANDH>NESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(m) SEQUENCE DESCRIPTION: SEQ ID NO:445: 
TQAGCCAGAT 

(2) INFORMATION FOR SEQ ID NO:446: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paid 

(B) TYPE: codelc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<H) MOLECULE TYPE* DNA 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO*4& 

GAGCCAOATA 

(2) INFORMATION FOR SEQ ID NO*47: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObiscpiirx 

(B) TYPE: nnddc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ m NO:447: 
AGCCAGATAA 

(2) INFORMATION FOR SEQ ID N0^48: 

CO SEQUENCE CHARACTHUSTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nnddc add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NOtf4& 

GOCAGATAAG 

(2) INFORMATION FOR SEQ ID NO-j44* 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nnddc acid 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: 

fffi MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SBQ IDN0449: 
CCAGATAAGG 

(2) INFORMATION FOR SEQ ID NO:45(h 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic add 

(C) STRANDEDNESS: single 
P) TOPOLOGY: line* 

(u) MOLECULE TYPE: UNA 

(3d) SEQUENCE DESCRIPTION: SEQ IDNO:450: 

CAGATAAOCT 

(2) INFORMATION FOR SEQ ID NOrtSl: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic tcid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:451; 
AOATAAGGTA 

(2) INFORMATION FOR SEQ ID NO:452: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bass pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
CD) TOPCHjOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: 

GATAAGGTAG 

CO INFORMATION FOR SEQ ID NO:453: 

0) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 buc pitas 
(B)TYPE: nucleic add 
(Q STRANDEDNESS: tingle 
<D) TOPOLOGY; Unear 

(U) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ IDN0453: 

ATAAGGTAGA 

(2) INFORMATION FOR SEQ IDNO:4S4: 

(0 SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pain 

(B) TYPE- nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(id) SEQUENCE DESCRIPTION: SEQ ID NO:454: 

TAAGGTAGAA 

(2) INFORMATION FOR SEQ ID NO:455: 

0) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base paint 

(B) TYPE: mwldc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unear 

ffi) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO:45S: 
AAGGTAGAAG 

(2) INFORMATION FOR SEQ ID NO:456: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
{B) TYPE: mideic acid 
(O STRANDEDNESS: single 
0>) TOPOLOGY: finesr 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:456: 

AGGTAGAAGA 

(2) INFORMATION FOR SEQ ID N0457: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudeio add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(5) MOLfeCULE TYPE: DNA 
(aJ) SEQUENCE DESCRIPTION: SEQ ID NO:457: 
GGTAGAAGAG 

(2) INFORMATION FOR SEQ ID NO:4S8: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE* nudeie add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:458: 
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GTAGAAGAGG 

(2) INFORMATION FOR SEQ D>NO:459: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pate 

(B) TYPE: nucleic add 

CQ STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE; UNA 

feO SEQUENCE 0ESCRIP17QN; SEQ ID NO-.459: 

TAGAAGAOGC 

(Z) INFORMATION FOR SEQ ID N&460: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodefc add 

. (QSTRANDEDNESS: single 
<P) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

m SEQUENCE DESCRIPTION: SEQ ID NO*60: 

AGAA0AGGCC 

(2) INFORMATION FOR SEQ ID NO:46l: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pah? 

(B) TYPE: nodeic add 

(Q STRANDEDNESS: tingle 
(P) TOPOLOGY: 



02) MOLECULE TYPE: DNA 
<X0 SEQUENCE DESCRIPTION: SEQ ID NO** I: 
OAAGAGGOCA 

(2) INFORMATION FOR SEQ ZD NO*£2: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IObssepafe 

(B) TYPE: nudefcadd 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: fincar 

(n) MOLECULE TYPE: DNA 

W) SEQUENCE DESCRIPTION; SEQ ID KCM62: 

AAGAGGCCAA 

(2) INFORMATION FOR SEQ ID NOrf63: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nndeic add 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

00 MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SBQ IDNO:463: 
AOAGGCCAAT 

(2) INFORMATION FOR SEQ ID NO:464: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH* 10 base pain 

(B) TYRE: nude© add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



00 MOLECULE TYPE DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:464: 
GAGGCCAATA 

(2) INFORMATION FOR SEQ ID NOrftt: 

(0 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pah 
CD) TYPE: nucleic add 
(O STRANDHJNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:465: 

AGGCCAATAA 

(2) INFORMATION FOR SEQ ID NO:466: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: gndei c add 
(QSTRANDEDNESS: single 
P)TOPOLOOY: linear 

00 MOLECULE TYPE: DNA 

W) SEQUENCE DESCRIPTION: SEQ ID NO:46& 

GGCCAATAAA 

CO INFORMATION FOR SEQ ID NO>W7: 

0) SEQUENCE CHARACTERISTICS: 
(A)LENGIH: 10 base pain 
CB) TYKE: puulritt acid 
(QSTRANDEDNESS: single 
(D) TOPOLOGY: 



(H) MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQIDNO*67: 
GCCAATAAAG 
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(2) INFORMATION FOR SEQ ID NO:46S: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: micJdo told 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

fri) SEQUENCE DESCRIPTION: SEQ 10 NO.461: 

CCAATAAAGO 

(2) INFORMATION FOR SEQ ID N04 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pun 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

QO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO*69: 

CAATAAAGGA 

(2) INFORMATION FOR SEQ ID NO470: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodefc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(S) MOLECULE TYPE DNA 

0u) SEQUENCE DESCRIPTION: SEQ ID NO*70: 

AATAAAGGAG 

a) INFORMATION FOR SEQ ID N&471: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LDZGTH: 10 base pan 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ti) MOLECULE TYPE: DNA 
ft) SEQUENCE DESCRIPTION: SSQ ID N047I: 
ATAAAGGAOA 

(2) INFORMATION FOR SEQ ID N0472: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
(B> TYPE: nudeto add 
(Q STRANDEDNESS: single - 
(D) TOPOLOGY: ] 
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00 MOLECULE TYPE: DNA 
Ori) SEQUENCE DESCRIPTION: SEQ ID N0472: 
TAAAGGAGAG 

(2) INFORMATION FOR SEQ ID NO*73: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pun 

(B) TYPE: mideie acid 

<Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0473: 

AAAGGAGAGA 

(Z) INFORMATION FOR SEQ ID N0474: 

Q) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYFE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:474: 

AAGGAGAOAA 

(Z) INFORMATION FOR SEQ ID NO;475: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pairs 

(B) TYPE: nucleic add* 

(O STRANDEDNESS: single 
<p) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:475: 

AGGAOAOAAC 

(2) INFORMATION FOR SEQ ID NO>476: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(jri) SEQUENCE DESCRIPTION: SEQ ID NO:476: 

GGAGAGAACA 

(Z) INFORMATION FOR SEQ ID N0477: 
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(0 SEQUENCE CHARACTERISTICS; 

(A) LENGIH: 10 base pairs 

(B) TYPE: uuctete tcld 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ID MOLECULE TYPE: DNA 

M> SEQUENCE DESCRIPTION: SEQ ID NO:477: 

GAGAGAACAC 

<2) INFORMATION FOR SEQ E> NCW78: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH; SO bee purs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: (ingle 
(D) TOPOLOGY:! 



00 MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:478: 
AQAGAACACC 

(2) INFORMATION FOR SEQ ID NOi479: 

<5) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic Kid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Itoear 

Co) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ IDN0479: 
GAGAACACCA 

(2) INFORMATION FOR SEQ ID NO480: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
CB) TYPE: rmcfefc acid 
(Q STRANDEDNESS: itagfc 
(D)TOP03jOGY: 



(H) MOLECULE TYPE: DNA 
M SEQUENCE DESCRIPTION: SEQ IDNCfrttO: 
AGAACACCAG 

(2) INFORMATION FOR SEQ ED NO*81: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mtdefcadd 

<C) STRANDEDNESS: stogie 
(D) TOPOLOGY: fiacsr 

® MOLECULE TYPE: DNA 



W) SEQUENCE DESCRIPTION: SEQ ID NO:48I: 
GAACACCAGC 

(2) INFORMATION FOR SEQ ID NOrf82: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:482: 
AACACCAjOCT 

(2) INFORMATION FOR SEQ ID N&4&: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*83: 
ACACCAGCTT 

(2) INFORMATION FOR SEQ ID NO:484: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nscfeicidd 

(C) STRANDEDNESS: shftje 
<P) TOPOLOGY: Gncar 

(u) MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ ID NO:484: 

CACCAGCTTO 

(2) INFORMATION FOR SEQ ID N048S: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pais 
CB) TYPE: nucleic add 

(C) STRANDEDNESS: tmgfe 

(D) TOPOLOGY: 



<H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*8S: 
AOCAGCTTGT 

(2) INFORMATION FOR SEQ ID NO:4«6 : 
0) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DMA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:486: 

CCAGCTTGTT 

(2) INFORMATION FOR SEQ ID NO:4S7: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 disc pairs 

(B) TYPE: nucleic arid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MCtECULE TYPE: UNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:437: 

CAGCTTGTTA 

® INFORMATION FOR SEQ ID N0488: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: Dnclefe add 

(C) STRANDEDNESS: itogle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
W) SEQUENCE DESCRIPTION: SEQ ID NOrtSfc 
AGCTTOTTAC 

(2) INFORMATION FOR SEQ ID NO*t89*. 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 haw pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
<P) TOPOLOGY: linear 

00 MOLECULE TYPE: UNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:4S9: 

GCTTGlTACA 

(2) INFORMATION FOR SEQ ID NO:490: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic add 

CO STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULg TYPE: DNA 

(ri> SEQUENCE DESCRIPTION: SEQ ID NO490: 
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CTTGTTACAC 

(2) INFORMATION FOR SEQ ID N0491: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: suckle add 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(!i) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49I: 

TTGTTACACC 

(2) INFORMATION FOR SEQ ID NO:492: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 bate pain 

(B) TYPE: nucleic acid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: finear 

00 MOLECULE TYPE: DNA 

C*0 SEQUENCE DESCRIPTION: SEQ ID N0492: 

TGITACACCC 

<2) INFORMATION FOR SEQ ID N&493 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tee pais 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N&493: 
GTTACACCCT 

(2) INFORMATION FOR SEQ ID NO:494: 

0) SEQUENCE CHARACTERISTICS! 

(A) LENGTH: lObaaa pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fines? 

00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO*94: 
TTACACCCTG 

(2) INFORMATION FOR SEQ ID NO:49S : 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
03) TYPE: nucleic add 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: Knew 

(H) MOLECULE TYPE' DNA 
(xD SEQUENCE DESCRIPTION: SEQ ID NO:495: 
TACACCCTGT 

(2) INFORMATION FOR SEQ ID NO:496: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 hoc pate 

(B) TYPE: oneiric arid 

(C) STRANDEDNESS: slngjc 

(D) TOPOLOGY: Bncar 

GO MOLECULE TYPE: DNA 
(jd) SEQUENCE DESCRIPTION: SEQ ID NO:496: 
ACACCCreiG 

(2) INFORMATION FOR SEQ ID NOW: 

CD SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 10 base pairs 
<B) TYPE: nucldc add 
CC) STRANDEDNESS: single 
(D) TOPOLOGY: 



09 MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ ID N0:497: 
CACCCTGTGA 

(2) INFORMATION FOR SEQ ID NO:498: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 Case piss 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

cxi) sequence description: seq id no*9s: 
acoctoigao 

(2) information for seq d>no*99: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: coddc add 

(C) STRANDEDNESS: tinge 

(D) TOPOLOGY: Dnear 

(H) MOLECULE TYPE: DNA 
(ti) SEQUENCE DESCRIPTION: SEQ ID NOrf9& 
C C CTGTOAOC 



(2) INFORMATION FOR SEQ ID NO-.500: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; mtelde add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO500: 
CCTGTGAOCC 

(2) INFORMATION FOR SEQ ID NO-JO 1 : 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: oneiric add 

(C) STRANDEDNESS: single 
CP) TOPOLOGY: 1 



CO MOLECULE TYPE: DNA 
Qti) SEQUENCE DESCRIPTION: SEQ IDNO502: 
CIXnOAGCCT 

(2) INFORMATION FOR SEQ ID NQ302: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mrckic add 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(si) SEQUENCE DESCRIPTION: SEQ ID NO302: 

TGTGAGCCTG . 

Q) INFORMATION FOR SEQ ID NO303 : 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
. (B)TYPE: nucleic add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(E) MOLECULE TYPE: DNA 

fed) SEQUENCE DESCRIPTION: SEQ ID NO303: 

GTGAGCCTGC 

(2) INFORMATION FOR SEQ ID NO304: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: iOhaseptbs 
CB) TYPE: midde add 
CO SHRANDEDNESS: single 
TO TOPOLOGY: 1 



WO 95/21912 



PCT/AU95/00063 



135 



(U) MOLECULE TYPE DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NOS04: 
TGAOOCTGCA 

(2) INFORMATION FOR SEQ ID NO:S05: 

Q) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: Dnddc acid 

<C) STRANDEDNESS: sxngfc 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

0tl) SEQUENCE DESCRIPTION: SEQ ID NO-J05: 

GAGCCTGCAT 

(2) INFORMATION FOR SEQ ID NO506; 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: ilogle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:506: 

AGCCTGCATO 

(2) INFORMATION FOR SEQ ID NO307: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Uncar 

01) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO307: 

GCCTGCATGG 

C2) INFORMATION FOR SEQ ID NO308: 

00 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
P) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

Qd) SEQUENCE DESCRIPTION: SEQ ID NOS0S: 

CCTGCATGGA 

(2) INFORMATION FOR SEQ IDNOSOfc 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:509: 
CTGCATGGAA 

(2) INFORMATION FOR SEQ ID NO310: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

04 SEQUENCE DESCRIPTION: SEQ ID NO:510: 
TGCATGOAAT 

(2) INFORMATION FOR SEQ ID N03 11 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:51l: 

GCATGGAATG 

(2) INFORMATION FOR SEQ ID NCH512: 

0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 one pans 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
p) TOPOLOGY: iniear 

Cii) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:512: 

CATQGAATGG 

(2) INFORMATION FOR SEQ ID NO*U: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: itagk 

(D) TOPOLOGY: linear 

0Q MOLECULE TYPE: DNA 
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(xQ SEQUENCE DESCRIPTION: SEQ ID N0S13: 
ATGGAATGGA 

(2) INFORMATION FOR SEQ ID NOJ14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: midcte add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 
Od) SEQUENCE DESCRIPTION: SEQ ID N0314: 
TGGAATGGAT 

(2) INFORMATION FOR SEQ ID NO-J13: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pales 

(B) TYPE: nuddcadd 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

OS) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0-.515: 

GOAATXSGATG 

(2) INFORMATION FOR SEQ ID N0316: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: rmdeioadd 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: finear 

(B) MOLECULE TYPE: DNA 

Cx3 SEQUENCE DESCRIPTION: SEQ ID N0316: 

GAATTXJATGA 

(2) INFORMATION FOR SEQ ED N0517: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuddcadd 

(O STRANDEDNESS: single 
(D) TOPOLOGY: Qatar 

<S) MOLECULE TYPE: DNA 

fri) SEQUENCE DESCRIPTION: SEQ IDN0317: 

AATGGATGAC 

(2) INFORMATION FOR SEQ ID N0318: 

(t) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pain 

(B) TYPE: nadek add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: finetr 

(n) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ18: 

ATGGATGACC 

(2) INFORMATION FOR SEQ ID NCH5I9; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nndefe add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: finear 

fri) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0319: 

TQQATGACOC 

(2) INFORMATION FOR SEQ ID NOJ20: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Goear 

<u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO520: 

GGATOACCCT 

(2) INFORMATION FOR SEQ IDN032I: 

(i) SEQUENCE CHARACTERISTICS: 
<A) LENGTH: 10 base pais 

(B) TYPE: nncldc add 

(C) STRANDEDNESS: stogie 

(D) TOPOLOGY; Bnear 

©MOLECULE TYPE: ENA 
(xi) SEQUENCE DESCRIPTION. SBQIDN0521: 
GATGACCCTO 

(2) INFORMATION FOR SEQ ID NOS22: 



(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pais 

(B) TYPE: mretete add 

(C) STRANDEDNESS: angle 

(D) TOPOLOGY: Qzxar 

CO MOLECULE TYPE: DNA 

^d) SEQUENCE DESCRIPTION: SEQ IDN0322: 
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ATGACGCTGA 

(2) INFORMATION FOR SEQ ID N0323: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TOPE: DNA 
W SEQUENCE DESCRIPTION: SEQ ID NQ523: 
TGACCCTGAG 

(2) INFORMATION FOR SEQ ID N0324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: imcldc add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(D) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0524: 

QAOCCTGAGA 

(2) INFORMATION FOR SEQ ID NO:525: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pahs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(Jtl) SEQUENCE DESCRIPTION: SEQ ID NO£2S: 

ACCCTQAQAG 

(2) INFORMATION FOR SEQ ID NO-J26: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(G) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(tl) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0326: 

CCCIGAGAGA 

(2) INFORMATION FOR SEQ ID N0327: 

0) SEQUENCE CHARACTERISTICS: 
'(A) LENGTH* 10 bate pain 
(B> TYPE: nndde add 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 

Od) SEQUENCE DESCRIPTION: SEQ ID N0327: 

CCTGAGAGAG 

(2) INFORMATION FOR SEQ ID NO-528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pairs 

(B) TYPE: ouefde add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52& 
CTGAGAGAGA 

(2) INFORMATION FOR SEQ ID NO-529: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0529: 
TGAGAGAOAA 

(2) INFORMATION FOR SEQ ID NO:330: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nudeio add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQ ID NO330: 
GAGAOAGAAG 

(2) INFORMATION FOR SEQ ID ■ 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nudric acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bncar 

(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQIDN0531: 
AGAGAGAAGT 
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(2) INFORMATION FDR SEQ IDN0532: 

(0 SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 tee pahs 

(B) TYPE: nucleic arid 

<C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: UNA 

(Jd) SEQUENCE DESCRIPTION: SEQ ID NO*S32: 

GAGAGAAGTQ 

(2) INFORMATION FOR SEQIDN0533: 

(0 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: few 

00 MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTTON: SEQ IDN0333: 
AGAGAAGTGT 

Q) INFORMATION FOR SEQ ID N0534: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: micldo acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<3) MOLECULE TYPE: 0NA 
(sQ SEQUENCE DESCRIPTION: SEQ ID NOS34: 
GAGAAGTGTT 

(2) INFORMATION FOR SEQ IDN0335: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE noctefe add 

<Q STRANDEDNESS: single 
(D) TOPOLOGY: 



(u) MOLECULE TYPE: DNA 
M SEQUENCE DESCRIPTION: SEQ ID N0335: 
AGAAGTGTTA 

(2) INFORMATION FOR SEQ ID N0536: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: tmd etc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(IQ MOLECULE TYPE* DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO-536: 
GAAGTX3TTAG 

(2) INFORMATION FOR SEQ ID N0537: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mrrinc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

(xD SEQUENCE DESCRIPTION: SEQ ID N0537: 

AAGTOTTAGA 

(2) INFORMATION FOR SEQ ID NO-J38: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

<xl) SEQUENCE DESCRIPTION: SEQ IDN0538: 

AGTGTTAGAO 

(2) INFORMATION FOR SEQ ID NOS39: 

©SEQUENCE CHARACTERISTICS: 

(A) LENG TH: 10 base pairs 

(B) TYPE: imdek add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: ] 



m MOLECULE TYPE: DNA 
(si) SEQUENCE DESCRIPTION: SEQ ID N0539: 
GTGTTAGAGT 

(2) ftiFORMATION FOR SEQ ID NO540: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IO base pairs 

(B) TYPE: SDclebacid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: liccax 

01) MOLECULE TYPE: DNA 
(id) SEQUENCE DESCRIPTION: SEQ IDNO540: 
TGTTAGAGIG 

(2) INFORMATION FOR SEQ ID NO*S4I: 
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(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pais 

(B) TYFB; nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO ^41; 

GTTAGAGTGG 

(2) INFORMATION FOR SEQ ID NO:542: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 
<D)T0KHjOOY: linear 

(H) MOLECULE TYPE DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0542: 

TTAGAGTGGA 

(2) INFORMATION FOR SEQ ID NG343 : 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQIDN0543: 
TAGAGTGGAG 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:545: 
GAGTGGAGGT 

(2) INFORMATION FOR SEQ ID N054& 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan* 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ ID NO:546: 

AGTGGAGGTT 

(2) INFORMATION FOR SEQ ID NO:547; 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paid 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: atage 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: S£QIDNCh547: 

GTGGAGGTTT 

(2) INFORMATION FOR SEQ ID N054& 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10b*Jcpair» 

(B) TYPE: nudefe add 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 



(2) INFORMATION FOR SEQ ID N0344: 



00 MOLECULE TYPE: DNA 



(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(D) TYPE: nucleic acid 
(G) STRANDEDNESS: single 
CD) TOPOLOGY: finear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDN0344: 

AGAGTOGAGG 



(2) INFORMATION FOR SEQ ID N0345: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucldc add 

(Q STRANDEDNESS: Single 
(D) TOPOLOGY: linear 

CO MOLECULE TYPE: DNA 



(xi) SEQU&iCE DESCRIPTION; SBQ ID N0548: 
TGGAGGTTTG 

(2) INFORMATION FOR SEQ ID NO:549: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base paid 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

0d) SEQUENCE DESCRIPTION: SEQ ID NO:549: 

GGAGGTTTGA 

(2) INFORMATION FOR SEQ IDNO350: 
CO SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10b*septiiS 

(B) TYPE* micldc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DMA 
W SEQUENCE DESCRIPTION: SEQ ID NO3S0: 
GAGGTTTGAC 

(2) INFORMATION FOR SEQ ID N0351: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Enear 

(IT) MOLECULE TYPE: DNA 
fri) SEQUENCE DESCRIPTION: SEQIDNOJ51: 
AGGTTTGACA 

(2) INFORMATION FOR SEQ ID N0352: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 toe pans 

(B) TYPE: oucldo acid 

(C) STRANDEDNESS: stogie 

(D) TOPOLOGY: 



(ii) MOLECULE TYPE: DNA 
W) SEQUENCE DESCRIPTION: SEQ ID N0352: 
GGTTTOAGAG 

CO INFORMATION FOR SEQ ID ND353 : 

(I) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
(B) TYPE: mxtetc add 
(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECUUE TYPE: DNA 

M SEQUENCE DESCRIPTION: SEQ IDNO-J53: 

GTITQACAGC 

(2) INFORMATION FOR SEQ ID N0554: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH' 20 base pain 

(B) TYPE: noctd c add 

(O STRANDEDNESS: stogie 
(D) TOPOLOGY: Hoear 

00 MOLECULE TYPE DNA 

frQ SEQUENCE DESCRIPTION: SEQ ID NOS54: 
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TTTGACAGCC 

(2) INFORMATION FOR SEQ ID N0355: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 10 base pate 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: 1 



(fi) MOLECULE TYPE: DNA 
(xf) SEQUENCE DESCRIPTTON: SEQ ID NO:555: 
TTCACAGCCG 

Q) INFORMATION FOR SEQ ID NOS56: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pain 

(B) TYPE: nudric add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: 1 



Qi) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOc5S6: 
TGACAGCCGC 

(2) INFORMATION FOR SEQ ID N0357: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
(H) TYPE: ttttjri c add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0557: 

GACAGCCGCC 

(2) INFORMATION FOR SEQ ID NOS5& 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: Dscfefc add 

(Q STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(h) SEQUENCE DESCRIPTION: SEQ ID N055S: 

acagcogoct 

0) information for seq id n0559: 

0) sequence characteristics: 

(A) LENGTH: 10 base pahs 

(B) TYPE nacleic arid 
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(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: UNA 
(») SEQUENCE DESCRIPTION: SEQ ID NO*59: 
CAGCCGCCTA 

(2) INFORMATION FOR SEQ ID NO360: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: ouclck told 

(Q STRANDEDNESS: angle 
(D) TOPOLOGY: fines 

(H) MOLECULE TYPE DNA 

(«) SEQUENCE DESCRIPTION: SEQ ID NOS60: 

AGOOGCCTAG 

CO INFORMATION FOR SEQ ID N0561: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bee pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: lingie 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:S61: 

GCCGCCTAGC 

(2) INFORMATION FOR SEQ ED NO:562: 

(0 SEQUENCE CHARACTBRKUCS: 

(A) LENGTH: 10 base pain 

(B) TYPE: Bockic acid 

(C) STRANDEDNESS: ringk 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
<xl) SEQUENCE DESCRIPTION: SEQ ID NO-J62: 
OCGCCTAGCA 

(2) INFORMATION FOR SEQ ID NO-^63 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE* oneiric acU 

<R STRANDEDNESS: stagk 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(id) SEQUENCE DESCRIPTION: SEQ ID NOJ63: 

OOOCTAGCAT 



(2) INFORMATION FOR SEQ ID NO:364: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: ibigk 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:564: 

GCCTAGCATT 

(2) INFORMATION FOR SEQ ID N0365: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: finear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0565: 

CCTAGCATTT 

<2) INFORMATION FOR SEQ ID N0366: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: onckk acid 

(Q STRANDEDNESS: aingk 
(D) TOPOLOGY: Qnear 

01) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS66: 
CTAGCATTTC 

<2) INFORMATION FOR SEQ ID NOSCT: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
' (B)TYPE: nndcteecid 
(Q STRANDEDNESS: dngle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA * 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0567: 

TAGCATTTCA 

(2) INFORMATION FOR SEQ ID N0368: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: onckk acid 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: I 
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(5) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOS68: 
AGCATTTCAT 

(2) INFORMATION FOR SEQ ID NO:56* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOtwsepaiis 

(B) TYPE: Bndefeadd 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: Bnear 

(fl) MOLECULE TYPE: DMA 
(») SEQUENCE DESCRIPTION: SEQ ID N0369: 
GCATTTCATC 

(2) INFORMATION FOR SEQ ID NO370; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 two pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tinge 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ70: 
CATTTCATCA 

(2) INFORMATION FOR SEQ ID NCk571: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 tec pair* 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Linear 

(II) MOLECULE TYPE: DNA 
(ti) SEQUENCE DESCRIPTION: SEQIDND37I: 
ATTTCATCAC 

(2) INFORMATION FOR SEQ ID NCfc5T2: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base piss 

(B) TYPE: nodctc acid 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

ed) SEQUENCE DESOUPTION: SEQ IDNQ572: 

TTTCATCACG 

(2) INFORMATION FOR SEQ IDNOS73: 



(I)SBQUDJCE CHARACTERISTICS: 

(A) LENGTH 10 base pain 

(B) TYPE* nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOS73; 

TTCATCACGT 

(2) INFORMATION FOR SEQ ID N0374: 

(I) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic arid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
fri) SEQUENCE DESCRIPTION: SEQ ID N0374: 
TCATCACGTO 

(2) INFORMATION FOR SEQ ID NO*?3: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 
(Q.STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DMA 

(xi) SEQUENCE DESCRIPTION: SEQIDNO*J75: 
CATCACGTGG 

(2) INFORMATION FOR SEQ n>NOS76: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE nuride add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bnear 

(u) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID N0576: 
ATCAOGfTOGC 

(2) INFORMATION FOR SEQ IDN0377: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic arid 

(Q STRANDEDNESS: stage 
(D) TOPOLOGY: 1 

HQ MOLECULE TYPE: DNA 
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(xi) SEQUENCE ASCRIPTION: SEQ ID NOJ77: 
TCACGTGGOC 

(2) INFORMATION FOR SEQ ID WJ78: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO:578: 

CACGTOOCCC 

(2) INFORMATION FOR SEQ ID N0579: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nocfefc icsd 

(Q STRANDEDNESS: lingle 
(D) TOPOLOGY: In 



<il) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ H>NO:579: 
ACCTGGCCCG 

(2) INFORMATION FOR SEQ ID N03SO: 

CD SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 bate pain 
O) TYPE: nudek add 

(C) STRANDEDNESS: tingle 
CD) TOPOLOGY: Hocar 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO380: 
CGTGGCCCGA 

(2) INFORMATION FOR SEQ ID N0581: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: xmddcadd 

(Q STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
m SEQUENCE DESCRIPTION: SEQ ID NO*Sl: 
GTGGCCCGAG 

a) INFORMATION FOR SEQ ID NOSS2: 

0} SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: midcic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION; SEQ ID N0382: 

TQGCCCGAGA 

p) INFORMATION FOR SEQ ID N03S3: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
P) TOPOLOGY: Hocer 

(S) MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:S83: 

GGCCCGAGAG 

C2) INFORMATION FOR SEQ ID NOsSM: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: micleio acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Linear 

00 MOLECULE TYPE- DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDN0584: 

GCCCGAGAGC 

CD INFORMATION FOR SBQ XDNOdSS: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucfek add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
0c0 SEQUENCE DESCRIPTION: SEQ ID NOJS5: 
CCCGAGAGCT 

(2) INFORMATION FOR SEQ ID N038& 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nodeic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Enear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID N05S& 
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CCGAGAGCTG 



Q) INFORMATION FOR SEQIDN05S7: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pais 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 



(3) MOLECULE TYPE: DNA 
(Jtl) SEQUENCE DESCRIPTION: SEQ ID N0347: 
CGAGAGCTGC 

(2) INFORMATION FOR SEQ ID NO-.588: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: ENA 

(xl) SEQUENCE DESCRIPTION: SEQ ID N0588: 

GAGAGCTGCA 

(2) INFORMATION FOR SEQ ID NOJ 89: 

CD SEQUENCE CHARACikKJSTlCS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(ad) SEQUENCE DESCRIPTION: SEQ ID NQ589: 

AQAOCTOCAT 

(2) INFORMATION FOR SEQ IDNO590: 

(I) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
<B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ ID N&.590: 
QAOCTOCATC 

(2) INFORMATION FOR SEQ ID NOc591: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: node* add 



(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0391: 
AGCTGCATCC 

C2) INFORMATION FOR SEQ IDN0391 

(!) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(B) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOS92: 
GCTGCATCCG 

(2) INFORMATION FOR SEQ ID NO:S93; 



0) SEQUENCE CHARACT ERIS T IC S: 

(A) LENGTH: 10 base pain 

(B) TYPE: ssdde add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION; SEQ ID N0393: 
CTGCATCCGG 

(2) INFORMATION FOR SEQ IDN0394: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE; nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 

Ocf) SEQUENCE DESCRIPTION: SEQ ID NO:594: 

TGCATCCGGA 

(Z) INFORMATION FOR SEQ ID N039S: 

(j) SEQUENCE CHARAClbKISIiCS: 

(A) LENGTH: lObuepsia 

(B) TYPE: oneiric odd 

CQ STRANDEDNESS: tingle 
(D) TOPOLOGY: finear 

GO MOLECULE TYPE: DNA 

(xl) SEQUENCE DESCRIPTION: SEQIDN0395: 

GCATCOGGAG 
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(2) INFORMATION FOR SEQ ID 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: ncdelc icld 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 1 



(ii) MOLECULE TYPE: UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOJ9& 
CATCCGOAGT 

(2) INFORMATION FOR SEQ ID NO:597: 

ffl SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
ffl) TYPE nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 
fcri) SEQUENCE DESCRIPTION: SEQ ID N0597: 
ATOCOOAGTA 

(2) INFORMATION FOR SEQ ID NOe598: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 10 base pus 

(B) TYPE* nndcfc idd 

(Q STRANDEDNESS: stogie 
(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNQ398: 

TCCGGAGTAC 

(2) INFORMATION FOR SEQ ID NO:599: 

0) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYFE nucleic add 

(C) STRANDEDNESS: lingj* 
(P) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 

frj) SEQUENCE DESCRIPTION: SEQ ID HOS99: 

CCGGAGTACT 

(2) INFORMATION FOR SEQ IDNO16OO: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pairs 

(B) TYPE nucieic acid 

(O STRANDEDNESS: single • 

(D) TOPOLOGY: linear 



(D) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NOtfOO: 
CGGAGTACTT 

(2) INFORMATION FOR SEQ IDNO:601: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 baic pairs 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ Q> NOrfOl: 

GGAGTACTTC 

(2) INFORMATION FOR SEQ ID NO:602: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pain 

(B) TYPE mir.frfc add 

<C) STRANDEDNESS: stogie 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6Q2: 

GAGTACTTCA 

(2) INFORMATION FOR SEQ ID NO*03: 

(i) SEQUENCE CHARACIERISTICS: 

(A) LENGTH: 10 base pair* 

(B) TYPE nucleic add 

(O STRANDEDNESS; tingle 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO*03: 

AGTACTTCAA 

C2) INFORMATION FOR SEQ ID NO£04: 

(0 SEQUENCE CHARACTERISTICS: 
CA)LENOTH: 10 bafopaha 
(B) TYPE nucleic add 
(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

flD MOLECULE TYPE DNA 

(xi) SEQUENCE DSCRTJTION: SEQlDNCh604: 

GTACTTCAAG 

(2) INFORMATION FOR SEQ ID NOtfOS: 
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(0 SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 toe pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: tear 

(H) MOLECULE TYPE: DNA 
(n) SEQUENCE DESCRIPTION: SEQ ID NO605: 
TACTTCAAGA 

(2) INFORMATION FOR SEQ ID NQ*06: 

0) SEQUENCE CHARAU bkATlICS: 

(A) LENGTH: lObasapiiis 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
P) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

fcri) SEQUENCE DESCRIPTION: SEQ ID NO*06: 

ACTTCAAGAA 

a) INFORMATION FOR SEQ ID NO:6D7: 

CO SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 

(B) TYPE: middc acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
Co) SEQUENCE DESCRIPTION: SEQ ID NOrfOT: 
CnCAAGAAC 

(2) INFORMATION FOR SBQ fl>NO£08: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGflR 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: SNA 
(at) SEQUENCE DESCRIPTION: SEQ ID NOS08: 
TICAAGAACT 



(A) LENGTH: 10 base] 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQIDNOrfOfc 
TCAAGAACTG 

(2) INFORMATION FOR SEQ ID NOrtlO: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 ban pans 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:610: 

CAAGAACTGC 

(2) INFORMATION FOR SEQ ID NO£l 1: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nndeio add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

CD) MOLECULE TYPE: DNA 

fr?) SEQUENCE DESCRIPTION: SEQ ID NOtfl I: 

AAGAACTGCT 

(2) INFORMATION FOR SEQ ID NOrf 12: 

(0 SEQU&tCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
03) TGPOUOGY: Gnear 

09 MOLECULE TYPE: DMA 

(Xi) SEQUENCE DESCRIPTION: SEQIDNOrfl2: 

AGAACTGCTG 



(2) INFORMATION FOR SEQtDNO*13: 

0) SEQUENCE CHARACTERISTICS: 
0) INFORMATION FOR SBQ IDNOrfft (A) LENGTH: 10 base pans 

(B) TYPE: nudcic add 

(i) SEQUENCE CHARACTERISTICS: <C) STRANDEDNESS: itatfe 
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(D) TOPOLOGY: Unco- 
CD MOUECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:<513: 
GAACTGCTGA 
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(2) INFORMATION FOR SBQ ID NO:614: 

(i) SEQUENCE CHARACTERISTICS ; 

(A) LENGTH: 1305 base pairs 

(B) TYPSi nucl eic acid 

(C) STRAMDEDNESS : single 

(D) TOPOLOGY i linear 

(li) KOLBCDLS TYPB: OTA 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 614: 

GAAGAQATTT GGGAGAACAT GACCTGGATG CAGTGGOAAA AAGAAATTCA CAATCACACA 60 

AAAIACATAT ACTCCTTACT TOAAAAATCG CAGAACCAAC AASAAAAGAA TGAACAAGAA 120 

CTATTGGAAT TGGATCAATG GGCAAGTTTG TGGAATTGGT TTGACATAAC AAAATGCCTO 180 

TGGTATATAA AAATATTCAT AATGGTAGTA GGAGOCTTGA TASGTTTAAC AATAOTTTTT 240 

GCTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGOATACT CACCATTGTC GTTTCAGACC 300 

CTCCTCCCAA CCCCGAGGGG ACCCOACAGG CCOGAAGGAA TCGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG ACAOATCCAC TCGATTAOTA CACGGATTCT TAOCACTTTT CTGGGACGAC 420 

CTGAGGAGCC TGTGCCTCTT CCTCTACCAC CACTTGAGAG ACTTACTCTT GATTOTAACA 4B0 

AGGATTGTGG AACTTCTGGG ACGCAGGGGA TGGGAAGCCC TCAAATATTG GTGGAACCTC 540 

CTAAAGTATT GGAGCCAGGA ACTCCMAAG AGTGCTGTTA TCTTGCTCAA TGCCACCGCC €00 

ATAGCAGTAG CTOAGGGGAC AGATAGAGTT TTAOAAGTAT TACAAAGAGC TTATAOAGCT €60 

ATCCTCCACA TACCTAGAAG AAXAAOACAG GGCCTCGAAA TGGCTTTGCT ATAAAATGGG 720 

TGGCAAGTGA GCAAAAAGTA GTGTAGTCAG ATAGCATGCA TCATAAGGGG TGGGGGCCAA 780 

CAACTAACAA TGCTGATCGT GCCTGGCTAG AAGCACAAGA GAAGGAAGAA GCGGGTTTTC 840 

CAGTCAAACC TCAGGTAGCT GTAGATCTTA GCCACTTTTT AAAAGAAAAG GGGGGACTOG 900 

AAGGGCTAAT TCACTCCCAA AGAAGACAAG ATACACAGTG CTGCAAACTA TTACCAGTGG SCO 

AGTCAGCOAA GATAGAAGAG OCCAATGGAG GAGAAAACCA CAGATTGTTC TGTTGGGGAC 1020 

TTTCCATCOG TTGGGGACTT TCCAAGQCGG GGTGGCCTGG GTGACTAGTT CCGGTGGGOA 1080 

CTTTCCAAGA AGGOGCGGCC TGGGCGGGAC TGGOGAGTGG CGAGCCCTCA GATGCTGCAT 1140 

AXAAGOIGCT GCTTTCTGCT GTTACTGGGT CTCTOGGGTT AGACCAGATC TGAGCCTGGG 120.0 

AGCTCTCTGG CTAACTAGGG AACCCACTGC TTAAGCCTCA AEAAAGCTTG CCTTGAGTGC 1260 

TTCAAGTAGT GTGTGCCOGT CTGTTOTGTG ACTCTGGTAT CTAGA 130S 
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(2) INFORMATION FOR SEQ ID NO] €15: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE99 : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 1 

OAAACAATTT GGGATAAC&T GACCTGGATG CAGTGGGAAA GAGAAATTGA CAATTACACA 60 

AACATAAXAT ACACCTTAAT TGAAGAATCG CAGAACCAAC AAGAAAAAAA TGAACTAOAA 120 

TTATTGGAAT TGGATAAATQ G6CAAATTTG TQGAATTGGT TTACTATATC AAACT6GCTA 180 

TGGTATATAA AATTATTCAT AATGGTAOTA GGAG6CTTG0 TAGGTTTAAG AATAQTTTTT 240 

ACTGTACTTT CTAXAGTGAA TAGAGTTAGG CAGGGATACT CACCATTGTC GTTTCAGACC 300 

CACCTCCCAA CCCCGAAGGG ACCCGACAGG CCAGAAGGAA TOTAAGAAGA AGGTGGAGAG 360 

AGAOAGAOAG GCAGCTCCAC TCGATTAGTG CACGGATTCT XAGCACTTTT CTGGGACGAC 420 

CTGAGGAGTC TGTGCCTCTT CAGCTACCAC CACTTGAGAG ACTTACTCTT GATTGTAACG 480 

AGGATTGTGG AACTTCTGGG ACGCAGGGGA TGGGAAGCCC TCAAATACTG GTGOAATCTC 540 

CTGCAGTATT GGAGGCAGGA ACTACAGAAG AGTGCTGTTA GCTTGTTCAA TGGCACGGCC 600 

ATAGCAGTAQ CTGAGGGGAC AOATAGAGTT ATAGAAGCTT TAOGAAGGGC TTATAGAGCT 660 

ATTCTCCACA TACCTAGAAG AATAAGACAG OGCTTAOAAA GGGCTTTGCT ATAAAATGGG 720 

TGGCAAGTGG TCAOAAAGTA GTOTGGTTAG AAGGCATGTA CCTTTAAGAC AAGGCAGCTA 780 

TAGATCTTAG CC tt C TT T TTA AAAGAAAAGG GGGGACTGGA AGGGCTAATT CACTCACAGA B40 

GAAGATCAGT TQAACCAGAA GAAGATAGAA GAGGCCATGA AGAAGAAAAC AACAGATTGT 900 

TCCGTTTGTT CCGTTGGGGA CTTTCCAGGA GACGTGGCCT GAGTOATAAG CCGCTGGGGA 960 

CTTTCCGAAG AOGCGTGACG GGACTTTCCA AGGCGAGGTG OCCTQGaCGG GACTGGGGAG 1020 

TGGOGAGCCC TCAGATGCTG CATATAAGCA GCTGCTTTCT GCCTGTACTG GGTCTCTCTG 10B0 

GTTAGACCAG ATCTGAGCCT GGGAOCTCTC TGGCTAACTA GGGAACCCAC TGCTTAAGOC 1140 

TCAATAAAGC TTGCCTTGAG TGCTTCAAQT AGTGTGTGCC COTCTOTTGT GTGACTCTGG 1200 

TATCTAGA 120B 
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(2) INFORMATION FOR SEQ ID NO* 16: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bote pain 

(B) TYPE; nucleic add 

(C) STRANDEDNESS: tingle 
p) TOPOLOGY: Knwx 

(li) MOLECULE TYPE: DNA 

Od) SEQUENCE DESCRIPTION: SEQ ID NO*)6: 

TOQAAOOGCT AATTTGGT 18 



(2) INFORMATION FOR SEQ ID NQrfl?: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pun 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

® MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID N0:617: 

ATCTTOCCTA AAAAATTAGC CTGTC 25 



(2) INFORMATION FOR SEQ ID NCH518: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base paiff 

(B) TYPE: mickic add 

(C) STRANDEDNESS: dogfe 

(D) T0PO5LOGY: Bnear 

(fi) MOLECULE TYPE: DNA 
(xj) SEQUENCE DESCRIPTION: SEQ ID NO* 18: 
AGOCTCAQAT CTGGTCIAAC 20 



(2) INFORMATION FOR SEQ ID NO* 19: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LBKHH: 20 ba» pain 

(B) TYPE: oneiri c add 

(Q STRANDEDNESS: single 
TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDNOrtl* 



AGCAGCAGGA AGCACTATGG 



20 



WO 95/21912 



- 151 - 



PCT/ATJ95/00063 



(2) INFORMATION FOR SEQ ID NOrffth 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 19 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

(XI) SEQUENCE DESCRIPTION: SEQ ID NO320: 

TGCTAGAGAT TTTCCACAC 19 



(2) INFORMATION FOR SEQ ID NO-.621: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 tropin 

(B) TYPE: nucleic add 

(€) STRANDEDNESS: single 
(D) TOPOLOGY: tear 

(U) MOLECULE TYPE; DNA 

(d) SEQUENCE INSCRIPTION: SBQ ID N04Q1: 

AGTQAATAGA GTTAGGCAGG 20 



(2) INFORMATION FOR SEQ ID N0:622: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pan 

(B) TYPE: futdefc add 

. (Q STRANDEDNESS: tingle 
09 TOPOLOGY: IfaM 

(U) MOLECULE TYPE: DNA 

(n) SEQUENCE DESCRIPTION: SEQ IDNO&2: 

GTAAGACAGT ATGATCA0AT A 21 



C2) INFORMATION FOR SEQ ID NOtf23: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 21 base pain 

(B) TYPE: mukto add 

(CJ STRANDEDNESS; tingle 
(D) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ IDN0323: 



TTGTAGGGAA TTCCAAATTC C 



21 
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(2) INFORMATION FOR SEQ ID N0624: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base fobs 

(B) TYPE; nudcic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(«) MOLECULE TYPE: UNA 
W SEQUENCE DESCRIPTION: SEQ IDNO£24: 
CAGGATOCTA CACCTUTCAA CATAAT 26 



(2) INFORMATION FOR SEQ ID N0325: 

(Q SEQUENCE CHARACTERISTICS; 
(A) LENGTH: 22 base pate 
O) TYPE: nucleic add 
(Q) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ti) MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ EDNOrf2S: 

GGGAATTCCF TATTOCIGCT TG 22 



(2) INFORMATION FOR SEQ 1DND£26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 test pain 
<H) TYPE: nodefc add 
CQ STRANDEDNESS: dntfe 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

OcD SBQUQ2CE DESCRIPTION: SEQ ID N0t£26: 

CCAGAAGTTC CACAATCC IS 



C2) INFORMATION FOR SEQ ID NO*27: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pans 

(B) TYP£: nucleic add 

(Q STRANDEDNESS: tingle 
CD) TOPOLOGY: linear 

(0) MOLECULE TYPE: DNA 

On) SEQUENCE DESCRIPTION: SEQ ID NO*27: 



TTCITCTAGG TATGIGGAG 



19 
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(2) INFORMATION FOR SEQ ID NO:628: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 hue pain 

(B) TYPE nucleic acid 

(Q STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: DMA 

0d) SEQUENCE DESCRIPTION: SEQ ID NO:628: 

AGTGAATTAG CCCTTCCAO 19 



(2) INFORMATION FOR SEQ ID NOrf29: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(n) SEQUENCE DESCRIPTION: SEQ IDNO:629: 

TOCTAGAGAT TTTCCACAC 19 , 



Q) INFORMATION FOR SEQ ID NO*30: 

(D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 171mm pain 

(B) TYPE: nucfeie acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bnear 

(IQ MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQIDNO:630: 
TGCTCTGGAA AACTCAT 17 



(2) INFORMATION FOR SEQ ID NO*3 1 : 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bate pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Qatar 

00 MOLECULE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SBQ ID NOttl: 



CTTTCTAXAO TGAATAGAO 



19 
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(2) INFORMATION FOR SEQ ID HO: 632: 

(i) SEQUENC E CHA RACTERISTICS : 

(A) LEN GTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO: 632 j 
TATTGGAGTC AOQAACT 

(2) INFORMATION FOR SEQ ZD N0 1 633: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 17 base pairs 
<B) TYPE i nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 
GGTCTAACCA GAGAGAC 



<2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 1 

Ala Val Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala 
1 5 io 



(2) INFORMATION FOR SEQ ID HO*635: 

(i) SEQUENCE CHARACTERISTICS: 

(A) leng ths 36 amino acids 

(B) TYPE: amino acid 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

Pro Thr Ser Gin Ser Arg Gly Asp Pro Thr Gly Pro Lye Glu Lys 
1 5 ' io 15 

Lys Lys Val Glu Arg Glu Thr Glu Tnr Asp Pro Pfaa Asp Thr 
20 25 ao 

Asp Pro His 

35 



17 



17 
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(2) INFORMATION FOR SEQ ID NO: 636: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH t 3$ amino acids 

(B) TYPEt amino acid 

(C) STRANDBDNESS : single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO 1 636* 

Pro Ser Ser Gin Pro Arg Gly Asp Pro Thr Gly Pro Lys Glu Ser Lys 
1 5 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Thr Asp Pro Leu Asp Tyr Thr 
20 25 30 

Asp Ser His 
35 



(2) INFORMATION FOR SEQ ID NO J 637: 

ii) SEQUE NCE CHARACTERISTICS : 

(A) LENGTH) 36 amino aoids 
<B) TYPE i amino acid 

(C) STRANDBDNESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 210:637: 

Pro Thr Ser Gin Pro Arg Arg Asp Pro Thr Gly Gin Lys Glu Ser Lys 
15 10 15 

Lys Lys Val Glu Arg Glu Thr Glu Ala Ala Pro Leu Asp Cys Thr 
20 25 30 

Asp Ser His 
35 



(2) INFORMATION FOR SEQ ID NOi63B: 

(1) SEQUENC E CHARACTERISTICS: 

(A) LENGTH: 110 amino aoids 

(B) TYPE i amin o acid 

(C) STRANDEDNBSSj single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPEt protein 

(xi) SEQUENCE DESCRIPTION: SEQ XD NO: 63 8 : 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
15 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie His Ser lie Ser Glu 
20 25 30 

Arg lie Leu Ser Thr Tyr Leu Gly Arg Ser Ala Glu Pro Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys *sn Glu Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Val Gly Ser Pro Gin He Leu Val Glu 
€5 70 75 B0 
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Ser Pro Thr Val Leu Glu Ser Gly Thr Lys Glu Cys Cys Leu 

85 90 95 

Ala Gin Cys His Ser His Ser Ser Ser Gly Asp Arg 

100 105 110 



(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUE NCE CH ARACTERISTICS; 

(A) LENGTH: 110 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESSs single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPEi protein 

(Xi) SEQUENCE DESCRIPTION i SEQ ID NO: £39: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1 5 10 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin lie His Ser lie Ser Thr 
20 25 30 

Arg lie Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Pro Val Pro Leu 
35 40 45 

Pro Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Lys Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Net Gly Ser Pro Gin He Leu Val Glu 
65 70 75 B0 

Pro Pro Lys Val Leu Glu Pro Gly Thr Ala Glu Glu Cys Cys Tyr Leu 
85 90 95 

Ala Gin Cys His Arg Bis Ser Ser Ser Gly Asp Arg 

100 X05 HO 



(2) INFORMATION FOR SSQ ID NO: 640: 

(i) SEQUENCE CH ARACTERISTICS i 

(A) LEN GTH? no amino aoids 

(B) TYPE; amino acid 

CO STRANDBDNBSSi single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTIONS SEQ ID NO: 640: 

Asp Pro Pro Pro Asn Pro Glu Gly Thr Arg Gin Ala Arg Arg Asn Arg 
1 S 10 • 15 

Arg Arg Arg Trp Arg Glu Arg Gin Arg Gin Leu His Ser lie Ser Ala 
20 25 30 

Arg He Leu Ser Thr Phe Leu Gly Arg Pro Glu Glu Ser Val Pro Leu 
35 40 45 

Gin Leu Pro Pro Leu Glu Arg Leu Thr Leu Asp Cys Asn Glu Asp Cys 
50 55 60 

Gly Thr Ser Gly Thr Gin Gly Met Gly Ser Pro Gin He Leu Val Glu 
*5 70 75 80 

Ser Pro Ala Val Leu Glu Ala Gly Thr Thr Glu Glu Cys Cys Leu 
85 90 95 
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Val Gin Txp Hla Gly His Ser Ser Ser Gly Asp Arg 

100 105 110 



U) XNFORMftTIOH FOR SBQ ID NO ! 641: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH i 23? amino acids 

(B) TYPE: amino acid 

<C) STRAHDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 

ixi) SEQUENCE DESCRIPTION: 9EQ ID NO: 641: 

Glu Gin lie Trp Asn Asn Met Thr Trp Met Glu Txp Asp Arg Glu He 
1 5 10 15 

Asn Asn Tyr Thr Ser Leu lie His Ser Leu lie Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 45 

Ser Leu Trp Asn Trp Phe Asn lie Thr Asn Txp Leu Txp Tyr lie Lys 
50 55 60 

Leu Phe He Met He Val Gly Gly Leu Val Gly Leu Arg lie val Phe 
65 70 75 60 

Ala Val Leu Ser He val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 

05 90 95 

Ser Phe Gin Thr His Leu Pro lie Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly lie Glu Glu Glu Gly Gly Glu Arg Asp Arg Asp Arg Ser He Arg 
115 120 125 

Leu Val Aan Gly Ser Leu Ala Leu He Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cys Leu Phe Ser Tyr His Arg Leu Arg Asp Leu Leu Leu He Val Thr 
145 ISO **' 15S 160 

Arg He Val Glu Leu Leu Gly Arg Arg Oly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Gin Tyr Trp Ser Gin Glu Leu Lys Asn Ser Ala 
180 185 190 

Val Asn Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val lie Glu Val Leu Gin Ala Ala Tyr Arg Ala He Arg His He 
210 215 220 

Pro Arg Arg Ha Arg Gin Gly Leu Glu Arg He Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID NO r 642 : 

(i) SEQUE NCE CH ARACTERISTICS : 

(A) LENGTH: 237 amino acids 

(B) TTPE: amin o acid 

(C) STRANDEBNESS: single 

(D) TOPOLOGYi linear 

{ii} MOXiECOLB TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: £42: 

Glu Glu He Trp Glu Asn Met Thr Trp Met Gin Trp Glu hym Glu He 
1 5 10 15 

His Asn His Thr Lye Tyr He Tyr Ser Leu Leu Glu Lys Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Gin Glu Leu Leu Glu Leu Asp Gin Trp Ala 
35 40 45 

Ser so** Asn ^f** Ile Tar L y" Tr P Leu T^P Tyr lie Lys 

He Phe He Met Val Val Gly Gly Leu He Gly Leu Arg Ile Val Phe 
*5 70 75 B0 

Ala Val Leu Ser He Val Asn Arg Val Arg Gin Gly Tyr Ser Pro Leu 
B5 90 95 

Ser Phe Gin Thr Leu Leu Pro Thr Pro Arg Gly Pro Asp Arg Pro Glu 
100 105 HO 

Gly He Glu Glu Met Gly Gly Glu Arg Asp Arg Asp Arg Sex Thr Arg 
115 120 125 

Leu Val His Gly Phe Leu Ala Leu Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cya Leu Phe Leu Tyr His His Leu Arg Asp Leu Leu Leu lie Val Thr 
145 ISO 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
165 170 175 

Trp Trp Asn Leu Leu Lys Tyr Trp Ser Gin Glu Leu Gin Lys Ser Ala 
1B0 185 190 

Val He Leu Leu Asn Ala Thr Ala He Ala Val Ala Glu Gly Thr Aso 
1SS 200 205 * 

Arg Val Leu Glu Val Leu Gin Arg Ala Tyr Arg Ala He Leu His Ile 
210 215 220 

Pro Arg Arg Ha Arg Gin Gly Leu Glu Met Ala Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID N0i643i 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE i protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 43: 

Glu Thr He Trp Asp Asn Hat Thr Trp Mat Gin Trp Glu Arg Glu He 
1 5 10 IS 

Asp Asn Tyr Thr Ass He lie Tyr Thr Leu He Glu Glu Ser Gin Asn 
20 25 30 

Gin Gin Glu Lys Asn Glu Leu Glu Leu Leu Glu Leu Asp Lys Trp Ala 
35 40 AS 

Asn Leu Trp Asn Trp Phe Ser He Ser Aan Trp Leu Trp Tyr He Lys 
50 55 60 

Leu Phe He Met Val Val Gly Gly Leu Val Gly Leu Arg He Val Phe 
€5 70 75 80 

Thr Val Leu Ser He Val Aan Arg Val Arg Gin Gly Tyr Ser Pro Leu 
85 90 95 

Ser Phe Gin Thr Hia Leu Pro Thr Pro Lya Gly Pro Asp Arg Pro Glu 
100 105 110 

Gly He Glu Glu Glu Gly Gly Glu Arg Asp Arg Gly Ser Ser Thr Arg 
115 120 125 

Leu Val Hia Gly Phe Leu Ala Leu. Phe Trp Asp Asp Leu Arg Ser Leu 
130 135 140 

Cya Leu Phe Ser Tyr Hia His Leu Arg Asp Leu Leu Leu He Val Thr 
145 150 155 160 

Arg He Val Glu Leu Leu Gly Arg Arg Gly Trp Glu Ala Leu Lys Tyr 
1(5 170 175 

Trp Trp Aan Leu Leu Gin Tyr Trp Arg Gin Glu Leu Gin Lya Ser Ala 
180 105 ISO 

Val Ser Leu Phe Aan Gly Thr Ala He Ala Val Ala Glu Gly Thr Asp 
195 200 205 

Arg Val He Glu Ala Leu Arg Arg Ala Tyr Arg Ala lie Leu His He 
210 215 220 

Pro Arg Arg He Arg Gin Gly Leu Glu Arg Ala Leu Leu 
225 230 235 
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(2) INFORMATION FOR SEQ ID N0:$44: 

(i) SEQUENCE CH ARACTERISTICS : 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 

CO STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

Met Gly Oly Lye Trp Ser Lye Ser Ser Val lie Qly Trp Pro Ala Val 
1 & 10 15 

Arg Glu Arg Met Arg Arg Ala Glu Pro Ala Ala Asp Gly Val Gly Ala 
20 25 30 

Val Ser Arg Asp Leu Glu Lys His Gly Ala He Thr Ser Ser Asn Thr 
33 40 45 

Ala Ala Asn Asn Ala Ala Cys Ala Trp Leu Glu Ala Gin Glu Glu Glu 
50 55 60 

Glu Val Gly Phe Pro Val Thr Pro Gin Val Pro Leu Arg Pro Met Thr 
63 70 75 80 

Tyr Lys Ala Ala Val Asp Leu Ser His Phe Leu Lys Glu Lys Gly Gly 
B5 90 95 

Leu Glu Gly Leu He His Ser Gin Arg Arg Gin Asp He Leu Asp Leu 
100 105 110 

Trp He Tyr His Thr Gin Gly Tyr Phe Pro Asp Trp Gin Asn Tyr Thr 
115 120 125 

*ro Gly Pro Gly Val Arg Tyr Pro Leu Thr Phe Gly Trp Cys Tyr Lys 
130 135 140 

Leu Val Pro Val Glu Pro Asp Lys val Glu Glu Ala Asn Lys Gly Glu 
"5 150 155 1C0 

Asn Thr Ser Leu Leu His Pro Val Ser Leu His Gly Met Asp Asp pro 
165 170 175 

Glu Arg Glu Val Leu Glu Trp Arg Phe Asp Ser Arg Leu Ala Phe His 
160 185 190 

His Val Ala Arg Glu Leu His Pro Glu Tyr Phe Lys Asn Cys 
195 200 205 



(2) INFORMATION FOR SSQ ID 110:645: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino aoids 
IB) TYPE: amin o acid 

(C) STRANDEDNESS? single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 645: 

Met Gly Gly Lys Ala Lys Ser Ser Val Val Arg His Ala Ser 
1 5 10 is 

Gly Val Gly Ala Asn Asn Gin Cya 

20 25 
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(2) INFORMATION FOR SBQ ID HO: 646: 

(i) sequenc e ch aracteristics t 

(A) LENGTH i 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION t SBQ ID HO: £46: 

Met Gly Gly Lys Trp Ser Glu Ser Ser Val Val Arg Arg His Val Pro 
1 5 10 15 

Leu Arg Gin Gly Ser Tyr Arg Ser Pro Leu 
20 25 



(2) INFORMATION* FOR SBQ ID NO: 647 1 

(i) SEQUENCE CHARACTERISTICS J 

(A) LENGTH t 74 base pairs 

(B) TYPE: nucleic acid 

(C) 8TRANDEDNESS ? Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 

CTAGAAGGGA CTTTCCGCTG GGGACTTTCC AGGGftGGCGT GGCCTGGGCG GOACTGGGGA 60 
GTGGCGAGCC CTCA 74 



(2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 8 : 

CTGTTGGGGA CTTTCCATCC GTTGGGGACT TTCCAAGGCO GCGTGGCCTG GGTGACTAGT 60 
TCCOGTQGGG ACTTTCCA 78 



(2) INFORMATION FOR SEQ ID NO: 64 9 1 

(i) SEQUENCE CH ARACTERISTICS a 

(A) LENGTH: 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS t single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 649: 



CCGTTTGTTC CGTTGGGGAC TTTCCAGGAG ACGTGGCCTG AGTGACTAAG CCGCTQGGGA 
CTTTCCG 



60 
67 
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(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH s 621 base pairs 

(B) TYPE? nucleic acid 

(C) STRANDEDNESS i single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 

ATGGGTGGCA AGTGGTCAAA AAGTAGTGT5 AT7GGATGGC CT G CTG TA AG GGAAAGAATO 60 

AGACGAGCTG ASCCAGGAGC AGATGGGGTG GGAGCAGT&T CTCGAGACCT AGAAAAACAT 120 

GGAGCAATCA CAAGTAGCAA TACAGCAGCT AACAATGCTG CTTGTGCCTO GCTAGAAGCA 180 

CAAGAGGAGG AAGAGGTGGG TTTTCCAGTC ACACCTCAGG TACCTTTAAG ACCAATGACT 240 

TACAAGGCAG CTGTAGATCT TAGCCACTTT TTAAAAGAAA AGGGGGGACT GGAAGGGCTA 300 

ATTCACTCCC AAAGAAGACA AGATATCCTT GATCTGTGGA TCTACCACAC ACAAGGCTAC 360 

TTCCCTGATT CGCAGAACTA CACACCAGGG CCAGGGGTCA GATATCCACT GACCTTTGGA 420 

TGGTGCTACA AGCTAGTACC AGTTGAGCCA GAXAAGGTAG AAGAGGCCAA TAAAGGAGAG 480 

AACACCAGCT TGTTACACCC TGTGAGCCTG CATGGAATGG ATGACCCTGA 6AGAGAAGTG 540 

TXAGAGTGGA GGTTTGACAG CCGCCTAGCA TTTCATCACG TGGCCCGAGA GCTGCATCCG 600 

GAGTACTTCA AGAACTGCTG A $21 

(2) INFORMATION FOR SEQ ID NOrCSli 

(i) SEQUENCE CH ARACTERISTICS ; 

(A) LENG TH; 1596 base pairs 

(B) TYPBi nucl eic acid 

(C) STRANDEDNESS; single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ZD NO: 651: 

GAACAGATTT GOAATAACAT GACCTGGATG GAGTGGGACA GAGAAATTAA CAATTACACA 60 

AGCTTAATAC ACTCCTTAAT TGAAGAATCG CAAAACCAGC AAGAAAAGAA TGAACAAOAA 120 

TTATTGGAAT TAGATAAATO GGCAAOTTTG TGGAATTGGT TTAACATAAC AAATTGGCTG 180 

TGGTATATAA AATTATTCAT AATGATAOTA GGAGOCTTO* TAGGTTTAAG AATAGTTTTT 240 

GCTGTACTTT CTATAGTGAA TAGAGTTAGG CAGGGATATT CACCATTATC GTTTCAGACC 300 

CACCTCCCAA TCCCGAGGGG ACCCGACAGG CCCGAAGGAA TAGAAGAAGA AGGTGGAGAG 360 

AGAGACAGAG ACAGATCCAT TCGATTAGTG AACGGATCCT TAGCACTTAT CTGGGACGAT 420 

CTGCGGAGCC TGTGCCTCTT CAGCTACCAC CGCTTGAGAG ACTTACTCTT G A TT GTA ACG 480 

AGGATTGTGG AACTTCTGGG ACGCAOGGGG TGGGAAGCCC TCAAATAITG GTGGAATCTC 540 

CTACAGTATT GGAGTCAGGA ACTAAAGAAT AGTGCTGTTA ACTTGCTCAA TGCCACAGCC 600 

AXAGCAGTAG CTGAGGGGAC AGATAGGGTT ATAGAAGTAT TACAAGCAGC TTATAOAGCT 660 

ATTOGCCACA XACCTAGAAG AATAAGACAG GGCTTGGAAA GGATTTTGCT ATAAGATGGG 720 

TGGCAAGTGG TCAAAAAGTA GTGTGATTGG ATGGCCTBCT GTAAGGGAAA GAATGAGACG 780 
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AGCTGAGCCA GCAGCAGATG OOGTOOOAOC AGTATCTCGA GACCTAGAAA AACATGGAGC 840 

AATCACAAGT AGCAATACAG CAGCTAACAA TGCTGCTTGT GCCTGGCTAG AAGCACAAGA 900 

GGAGGAAGAG GTGGGTTTTC CAGTCACACC TCAGGTACCT T7AAGACCAA TGACTTACAA 960 

GGCAGCTGTA GATCTTA0CC ACTTTTTAAA AGAAAAGGGG GGACTGGAAG GGCTAATTCA 1020 

CTCCCAAAGA AGACAAGATA TCCTTGATCT GTGGATCTAC CACACACAAG GCTACTTCCC 1080 

TGATTGGCAG AACTACACAC CAGGOCGAGQ GGTCAGATAT CCACTGACCT TTGGATGGTG 1140 

CTACAAGCTA GTACCAGTTG AGCCAGATAA GGTAGAAGAG GCCAATAAAG GAGAGAACAC 1200 

CAGCTTGTTA CACCCTGTGA GCCTGCATGG AATGGATGAC CCTGAGAGAG AAGTGTTAGA 1260 

GTGGAGGTTT GACAGCCGCC TAGCATTTCA TCACGTGGCC CGAGAGCTGC ATCOGGAGTA 1320 

CTTCAAGAAC TGCTGACATC GAGCTTGCTA CAAGGGACTT TCCGCTGGGG ACTTTCCAGG 1380 

GAGGCGTGGC CTGGGCGGGA CTGGGGAGTG GCGAGCCCTC AGATGCTGCA TATAAGCAGC 1440 

TGCTTTTTGC CTGTACTGGG TCTCTCTGGT TAGACCAGAT CTGAGCCTGG GAGCTCTCTG 1500 

GCTAACTAGG GAACCCACTG CTTAAGCCTC AATAAAGCTT GCCTTGAGTG CTTCAAGTAG 1560 

TGTGTGCCCG TCTQ TTO I W GACTCTGGTA ACTAGA 1596 
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(2) INFORMATION FOR SEQ ID N0352: 

O SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: Qoetr 

(3) MOLECULE TYPE: UNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:6$2: 

CCTTTTTGCC 

(2) DEFORMATION FOR SEQ ID NO*53: 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH lObaseprirs 
(B) TYPE: oodeic acid 
(Q STRANDEDNESS: single 

(D) TOPOLOGY: finest 

(u) MOLECULE TYPE: DMA 

<xQ SEQUENCE DESCRIPTION: SEQ IDNOrfft: 

cmriuuiT 

(2) INFORMATION FOR SEQ ID NO*54: 

0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: tmekic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fine* 

(19 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0£54: 
TTTTTGOCTO 

(2) INFORMATION FOR SEQ ID N0*55: 

© SEQUENCE CHARACTERBTICS: 

(A) LENGTH: 10 base pairs 

(B) TYFE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: UNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0455: 
11 1 1GCCTGT 

(Z) INFORMATION FOR SEQ IDN0^56: 

0) SEQUENCE CHARACTERBTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: xmricTC add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: Qnear 



(5) MOLECULE TYPE DNA 
Cxi) SEQUENCE DESCRIPTION: SEQ IDNO:656: 
TTTGCCTGTA 

(2) INFORMATION FOR SEQ ID NO*57: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nacfeic add 

(Q STRANDEDNESS: tingle 
<D) TOPOLOGY: Sneer 

©MOLECULE TYPE* DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:657: 

TTGCCTCTAC 

(2) INFORMATION FOR SEQ ID NO&S8: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
(H) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(S) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO$5S: 
TGCCTQTACT 

(2) INFORMATION FOR SEQ 0>NO:659: 

(2) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE' nuefcic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: I 



00 MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQ ID NO*5* 
GGCTGTACTG 

(2) INFORMATION FOR SEQ IDNO360: 

O SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: mzdeicadd 

(CD STRANDEDNESS: single 
(D) TOPOLOGY: iatear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NOtftfO: 

GCTGTACFGG 
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(2) INFORMATION FOR SEQ ID NO:661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

GO MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:661: 

CTGTACTGGG 

<2) INFORMATION FOR SEQ ID N0662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: oncleic acid 

(C) S7RANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(>d> SEQUENCE DESCRIPTION: SEQ n>NO*62: 
TGTACTGGGT 

(2) INFORMATION FOR SEQ ID NO*63 : 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nuddcacid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fu) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ lDNO:663: 

GTACFGGGTC 

(2) INFORMATION FOR SEQ ID NO£64: 

(I) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 10 base pair* 

(B) TYPE: nucleic add 

CQ STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(U) MOLECULE TYPE: DNA 

W SEQUENCE DESCRIPTION: SEQ ID NOrf64: 

TAOGGGTCT 

0) INFORMATION FOR SEQ ID.NOrfft: 

© SEQUENCE CHARACTERISTICS: 
CA) LENGIH: 10 base pass 
(P)TYPE: nucleic add 

(C) STRANDEDNESS: angle 
(P) TOPOLOGY: linear 
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(5) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO*65: 
ACTOGGTCTC * 

(2) INFORMATION FOR SEQ ID NO:666: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pafas 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 1 



(ii) MOLECULE TYPE: DNA 
(xj) SEQUENCE DESCRIPTION: SEQ ID NO:666: 
CTGGGTCTCT 

(2) INFORMATION FOR SEQ IDNO:667: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGIH 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(2) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:667: 
TGGGTCTCTC 

(2) INFORMATION FOR SEQ ID NO*68: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base prim 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(H> MOLECULE TYPE: DNA 

(x0 SEQUENCE DESCRIPTION: SEQ ID NO:668: 

GGGItriCTCT 

(2) INFORMATION FOR SEQ ID Nttftfc 

(i) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pairs 
(B) TYPE: nudcic acid 
(O STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ Q>N0^69: 
GGTCTCTCTG 

p) INFORMATION FOR SEQ ID NO*70: 



WO 95121912 



PCT/AU95/00063 



166- 



(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH 10 base pan* 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE DKA 
(xD SEQUENCE DESCRIPTION: SEQ ID NOrfKh 
GTCTCTCTGG 

(2) INFORMATION FOR SEQ ID NO*7I: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: ID base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: fincar 

(0) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNOrf71: 

tciuiutggt 

O) INFORMATION FOR SEQ IDNOrfTl: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic sod 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:672: 
CTCTCTOGflT 

(2) INFORMATION FOR SEQ IDNO&73: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

01) MOLECULE TYPE DNA 

<ri) SEQUENCE DESCRIPTION: SEQ IDNOS73: 
TCICTOGTTA 

(2) INFORMATION FOR SEQ ID NO£74: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: sing{e 
03) TOPOLOGY: Sneer 

(H) MOLECULE TYPE DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:674: 
L'lClfJOUAG 

(2) INFORMATION FOR SEQ ZD NO*75: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nodefc add 

(C) STRANDEDNESS: single 
CP) TOPOLOGY: linear 

<fi) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO£75: 

iClClOGflA 

(2) INFORMATION FOR SEQ ID NOtf76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE sndeic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: fincar 

(H) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO&76: 
CTGGTTAGAC 

(2) INFORMATION FOR SEQ ID NO#77: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENOTBt 10 base pairs 

(B) TYPE nucleic add 

(Q STRANDEDNESS: single 
0>) TOPOLOGY: fincar 

(D) MOLECULE TYPE DNA 

(xi) SEQUBf GE DESCRIPTION: SEQ ID NOtf77: 

TGGTTAGAOC 

0) INFORMATION FOR SEQ IDNOtfTI: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO£73: 
GGTTAGACCA 

(2) INFORMATION FOR SEQ ID NOrfT* 
(0 SEQUENCE CHARACTERISTICS: 
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(A) LENCjTH: 10 base pain 

(B) TYPE: nucleic acid 

{Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

fn) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SBQ ID NO:67* 

GTTAGACCAG 



9) INFORMATION FOR SBQ ID NO:680: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bast ptfas 

(B) TYPE: nnckio icid 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SBQ ID NOtfSO: 

TTAGACCAGA 



(2) INFORMATION FOR SEQ ID NO:6Sl: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 hue pain 

(B) TYPE: nucWo add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Uncar 

(it) MOLECULE TYPE: DNA 

(kO SEQUENCE DESCRIPTION: SEQ ID NOS81: 

TAOACCAGAT 



(2) INFORMATION FOR SEQ ID NCfc68fc 

0) SEQUENCE CHARACTERISTICS: 

(A) LEN0IH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Uncar 

00 MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO£S2: 
AGAOCAGATC 



(2) INFORMATION FOR SEQ ID NOtf83: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYFE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ ID NO:6B3: 
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GACCAGATCT 



(2) INFORMATION FOR SEQ ID NO.-684: 

(0 SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pairs 
09) TYPE: nucleic acid 
(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(3d) SEQUENCE DESCRIPTION: SEQ IDNO:684: 

ACCAGATCTO 



(2> INFORMATION FOR SEQ ID NOrfU: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Smear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO:685: 

CCAGATCTGA 



<2) INFORMATION FOR SEQ ID NOtf86: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

CxQ SEQUENCE DESCRIPTION: SEQ ID NO*&6: 

CAOATCTOAG 



(2) INFORMATION FOR SEQ ID NO:6S7: 

0) SEQUENCE CHARACTERISTICS: 
W LENGTH: 10 base pairs 
(B)TYPE: nucleic add 
(Q STRANDEDNESS: single 
CD) TOPOLOGY: lfaiear 

00 MOLECULE TYPE: UNA 

CxO SEQUENCE DESCRIPTION: SEQ ID NO:687: 

AGATCTGAGC 



m INFORMATION FOR SEQ ID NO*S8: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: ujddo add 
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(Q STRANDEDNESS: singlfl 
CP) TOPOLOGY: Encar 

(B) MOLE&JLE TYPE: DNA 

(xO SEQUENCE DESCRIPTION: SEQ ID Nfr&SS: 

GATCTGAGCC 



(2) INFORMATION FDR SEQ ID NOrfg* 

(9 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: IObascptin 
OB) TYPE: mulcts idd 
(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

<ri) SEQUENCE DESCRIPTION: SEQ IDNOtfS* 

ATCTGAGCCT 



(2) INFORMATION FOR SEQ ID NOtfKh 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 10 base pairs 

(B) TYPE: miriete add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 

(xj) SEQUENCE DESCRIPTION: SEQ ID NOrf90: 

TCTGAGCCTO 



(2) INFORMATION FOR SBQ ID NO*91: 

(I) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 hue pain 
(B) TYPE tmdeic acid 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

<H) MOLECULE TYPE: DNA 

W SEQUENCE DESOUPTTON: SEQ IDNCfc691: 

CIGAGCCTGO 



(2) INFORMATION FOR SEQ ED NO£92 : 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: wnrfrf» add 

(C) STRANDEDNESS: rfntfe 

(D) TOPOLOGY; linear 

(H) MOLECULE TYPE: DNA 
(ri) SEQUQfCE DESCRIPTION: SEQ ID NO&92: 
TGAGOCTGGG 
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{2) INFORMATION FOR SEQ ID NO*93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nocistcadd 

(P> STRANDEDNESS: single 
(D) TOPOLOGY: Hacar 

fd) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NOrfKJ: 

GAGOCTGGGA 



(2) INFORMATION FOR SEQ ID NO: W: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ IDN0394: 

AGCCTGGGAG 



(2) INFORMATION FOR SEQ ID N0495: 

©SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(H) MOLECULE TYPE: DNA 
(Jd) SEQUENCE DESCRIFTTON: SEQ ID N0695: 
GCCTGGGAGC 



(2) INFORMATION FOR SEQ ID NO£K: 

Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mxddc add 

(Q STRANDEDNESS: tingle 
(TflTOi*UXX?Y: faetr 

(0) MOLECULE TYPE: DNA 

(ri) SEQUENCE DESCRIPTION: SEQ ID NO*96: 

CCTOGGAGCT 



O) INFORMATION FOR SEQ ID N0^7: 

ffl SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pate 

(B) TYPE: nodefe add 

<C) STRANDEDNESS: datfe 
(D) TOPOLOGY: linear 
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(hi MOLECULE TYPE: UNA 
(xi) SEQUENCE DESCRIFnON; SEQ ID NOtf97: 
CTGGGAGCTC 

(2) INFORMATION FOR SEQ ID N0.69& 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: ] 



CO MOLECULE TYPE: UNA 
(xi) SEQUENCE DESCRIPTION: SEQIDNO:69S: 
TOGGAGCTCT 

(2) INFORMATION FOR SEQ ID ND:«9: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Ibcar 

(u) MOLECULE TYPE: DNA 

<xj) SEQUENCE DESCRIPTOR SEQ ID NOS99: 

GGGAGCICIC 

(2) INFORMATION FOR SEQ ID NO:700: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bin pun 

(B) TYPE; nuddo add 
(Q STRANDEDNESS: 
(D) TOPOLOGY: 



(n) MOLECULE TYPE: UNA 
(xi) SEQUENCE DESCRIPnON: SEQ ID NO:700: 
GGAGCTCtCT 

(2) INFORMATION FOR SEQ ID NOeTOi: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bus pahs 

(B) TYPE; nucleic add 

(Q STRANDEDNESS: tingle 
(D> TOPOLOGY; linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:701: 

GAGCTUTCTO 

(2) INFORMATION FOR SEQ ID NO302: 



(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* nucleic acid 

(Q) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(19 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO702: 

AGCIUTCTCG 

(2) INFORMATION FOR SEQ ID NO:703: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nodex add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(U) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: 

GCICTCTCGC 

(2) INFORMATION FOR SEQ ID NO;704: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base palm 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

Gi) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:704; 

CTCT CT OOCT 

(2) INFORMATION FOR SEQ IDNOtfOS: 

Q) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
O) TYPE: imckk acid 

(C) STRANDEDNESS: sfngte 

(D) TOPOLOGY: linear 

(10 MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ IDNO:705: 
TCTCTGGCEA 

(2) INFORMATION FOR SEQ IDNO:706: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate palm 

(B) TYPE: nucleic add 

(Q STRANDEDNE&: tingle 
(D) TOPOLOGY: Bnear 

(U) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ IDNO:706: 
CTCTGGCTAA 



(2) INFORMATION FOR SEQ ID NO;707: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 boss pun 

(B) TYPE: fiudete add 

(C) STRANDEDNESS: tingle 
P) TOPOLOGY: linear 

(u) MOLECULE TYPE; UNA 

Od) SEQUENCE DESCRIPTION: SEQ IDNOtf07: 

TCTGGCTAAC 



(2) INFORMATION FOR SEQ ID KO^OS: 

© SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 boa pairs 
(H) TYPE: ntfikfe sad 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE: UNA 
(id) SEQUENCE DESCRIPTION; SEQ IDNO:7Q8: 
CTGGCTAACT 



(2) INFORMATION FOR SEQ ID NO:70fc 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObasepiifs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(fi) MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQ ID NO:709: 
TGGCTAACTA 



©INFORMATION FOR SEQ IDNO:710: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: xmdeic add 
STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 
(»> SEQUENCE DESCRIPTION: SEQ EDNO:710: 
GGCTAACTAG 



(2) INFORMATION FOR SEQ ID NO:7U: 
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(A) LENGTH: 10 base pain 

(B) TYPE: nucleic scld 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(u) MOLECULE TYPE: DNA 
(Jd) SEQUENCE DESCRIPTION: SEQ ID NO:711: 
GCTAACTAGG 



(2) INFORMATION FOR SEQ ID NO:712: 

CD SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 

<xi) SEQUENCE DESCRIPTION: SEQ E>NO:7l2: 

CTAACTAGGG 



(2) INFORMATION FOR SEQ ID NO:713: 

G) SEQUENCE CHARACTERISTICS: 

(A) LENGTH lObisepaiis 

(B) TYPE: nucleic add 

(C) STRANDEDNSS; single 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 
Cd) SEQUENCE DESCRIPTION: SEQ ID NO:713: 
TAACtAGGGA 



(2) INFORMATION FOR SEQ ID NO:714: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pate 

(B) TYPE: rodeic add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

(5) MOLpCULE TYPE DNA 

0d) SEQUBTCE DESCRIPTION: SEQ ID NO:714: 

AACTAGGGAA 



(2) INFORMATION FOR SEQ ID NOf7l5: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: suddc add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(S) MOLECULE TYPE: DNA 



©SEQUENCE CHARACTERISTICS: 



fri) SEQUENCE DESCRIPTION: SEQ ID NO*7 L5 : 
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ACTACGOAAC 

(2) INFORMATION FOR SEQ ID N0^716: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: tingje 

(D) TOPOLOGY: linear 

09 MOLECULE TYPE- UNA 
(xi) SEQUENCE DESCRIPTION: SEQ H>NO:716: 
CTAGGGAACC 

Q) INFORMATION FOR SEQ ID NO:717: 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: I 



(B) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCRIPTION: SEQIDNO:717: 
TAGOGAAOOC 

(2) INFORMATION FOR SEQ ID NO:718: 

CD SEQUENCE CHARACTERISTICS: 

(A) LD^GTH: 10 hue pahs 

(B) TYPE' anddo arid 

(C5 STRANDEDNESS: tingle 
<P) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ ID N0718: 

AGOGAACCCA 

(2) INFORMATION FOR SEQ ID NO:719: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lOhttsepabs 

(B) TYPE: nadek add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(B) MOLECULE TYPE: DNA 
0d) SEQUENCE DESCRIPTION: SEQ ID NO:719*. 
GGGAACCCAC 

CZ) INFORMATION FOR SEQ IDNO:72D: 

(!) SEQUDJCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: oucteio add 



(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO:720: 

GGAACCCACT 

(2) INFORMATION FOR SEQ ID NO:721: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pah* 

(B) TY^E nucleic arid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 1 



G9 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ tt>NO:721: 
GAACCCACTG 

(2) INFORMATION FOR SEQ ID NO:722: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pate 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Hnear 

00 MOLECULE TYPE DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:722: 
AACCCACTGC 

(2) INFORMATION FOR SEQ ID NQ523: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDH2NESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:723 : 

ACCCACTOCT 

(2) INFORMATION FOR SEQ ID ND^24: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pahs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:724: 

CCCACTGCTT 
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(2) INFORMATION FOR SBQ IDNO:725: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObtse pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

Cxi) SEQUENCE DESCRIPTION: SEQ ID 1*0:723: 

CCACTGCTTA 

(2) INFORMATION FOR SEQ ID NO:726: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: rmdeifi acid 

<Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

CO MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ ID NO:726: 

CACTOC7TAA 

(2) INFORMATION FOR SEQ ID NO^/27: 

(I) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 10 base pahs 

(B) TYPE: tmckfc arid 

CO STRANDEDNESS: single 
(P) TOPOLOGY: 



QS) MOLECULE TYPE: DNA 
fri) SEQUENCE DESCRIPTION: SEQ ID NO:727: 
ACTOCTTAAG 

C2) INFORMATION FOR SEQ ID NOtf2fc 

Q SEQUENCE CHARACTERISTICS: 
(A) LENGTH 10 base pain 
(0) TYPE: nucleic add 

(C) STRANDHUNHSS: stagfe 

(D) TOPOLOGY: Cnear 

00 MOLECULE TYPE: DNA 
£d) SEQUENCE DESCRIPTION: SEQIDNO:728: 
CTGCTTAAGC 

(2) INFORMATION FOR SEQ ID NO:729: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: imdete add 

• (O STRANDEDNESS: single 
(D) TOPCttjOGY: ifxiear 
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fri) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:729; 
TGCTTAAGOC 

C2) INFORMATION FOR SEQ ID NO:730; 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGIH: 10 base pain 

(B) TYPE: noclefe add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(U) MOLECULE TYPE: DNA 
Od) SEQUENCE DESCRIPTION: SEQ ID NO:730: 
GCITAAGCCT 

(2) INFORMATION FOR SEQ ID NO:731: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: noddoadd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Gnear 

<u) MOLECULE TYPE: DNA 
<m) SEQUENCE DESCRIPTION: SEQJJ>NO:73l: 
CTTAAGCCIC 

(2) INFORMATION FOR SEQ ID NO:73i 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS; single 
CD) TOPOLOGY: linear 

(u) MOLECULE TVTE: DNA 

(zJ) SEQUENCE DESCRIPTION: SBQ ID NO:732: 

TTAAGCCTCA 

(2) INFORMATION FOR SEQ ID NOtf33: 

(5) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pass 
<B) TYPE fiadeb add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<B) MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID N0333: 
TAAGOCTCAA 

(2) INFORMATION FOR SEQ ID N0334: 
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0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE mickic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Unea/ 

(0) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:734: 
AAGCCTCAAT 

C2) INFORMATION FOR SEQ ID NO:735: 

CO SEQUENCE CHARACTERISTICS: 
(A) LENGIH: 10 base pans 
OB) TYPE* nucleic acid 
(© STRANDEDNESS: tingle 
(D) TOPOLOGY: 1 



(ii) MOLECULE TYPE: DNA 
(xD SEQUENCE DESCRIPTION: SEQIDN0335: 
AGOCTCAATA 

(2) INFORMATION FOR SEQ IDNCfcTM: 

<9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

<Q STRANDEDNESS: single 
(D) TOPOLOGY: 1 



09 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESOUPTION: SEQIDNO:736: 
GCCTCAATAA 

(2) INFORMATION FOR SEQ IDNO:737: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pals 
<B) TYPE: tmdale acid 
(C) STRANDEDNESS: single 
<P) TOPOLOGY: 



00 MOLECULE TYPE: DNA 
Cd) SEQUENCE DESCRIPTION: SEQ ID NO:737: 
CCTCAATAAA 

P) INFORMATION FOR SEQ ID NO:738: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGftH: 10 base pairs 

(B) TYPE: nucleic acid 

CO STRAN DEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: UNA 



(xf) SEQUENCE DESCRIPTION: SEQ ID NO:738: 
CTCAATAAAG 

(2) INFORMATION FOR SEQ ID NO:739: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pans 

(B) TYPE: mickic acid 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: 1 



09 MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO-.739: 
TCAATAAAGC 

(2) INFORMATION FOR SEQ ID NO.740: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pass 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: ilngle 

(D) TOPOLOGY: 



09 MOLECULE TYPE: DNA 
(*Q SEQUENCE DESCRIPTION: SEQ ID NO:740: 
CAATAAAGCT 

(2) INFORMATION FOR SEQ ID NO:74l: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic acid 

(Q STRANDEDNESS: stogie 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ IDNO.741: 

AATAAAGCTT 

(2) INFORMATION FOR SEQ ID NO:742: 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pain 
O) TYPE: nodefc add 
(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRBTTON: SEQ IDN0342: 

ATAAAGCITO 

(2) INFORMATION FOR SEQ ID N0443: 
0) SEQUENCE CHARACTERISTICS: 
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<A) LENGTH: lObsscpaia 
<B) TYPE: nucleic scad 
(Q STRANDEDNESS: angle 
(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:743: 

TAAAGCITCC 

(2) INFORMATION FOR SEQ ID NO:744: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObuepaia 

(B) TYPE: node* kM 

(Q STRANDEDNESS: tingle 
(D) TOPOLOGY; linear 

00 MOLECULE TYPE: DMA 

(rf) SEQUENCE DESCRIPTION: SEQ ID NO:744: 

AAAGCTTOCC 

(2) INFORMATION FOR SEQ ID NO:745: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic mad 

(C) STRANDEDNESS: single 
<D> TOPOLOOYi liner 

(U) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SBQIDNOff45: 

AAGCTTOCCT 

©INFORMATION FOR SEQ ID ND^4d: 

® SEQUENCE CHARACIERKTICS: 

(A) LENGTH: 10 bass {Bits 

(B) TYPE: nuddeacid 

(C) STRANDEDNESS: itngtc 

(D) TOPOLOGY: lin 



Cu) MOLECULE TYPE* DNA 
(*0 SEQUENCE DESCRIPTION: SEQ ID NO:74& 
ACCITOCCTT 

(2) INFORMATION FOR SEQ ID NO-747: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: cuckfc tetd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Qnr 

0D MOLECULE TYPE: DMA 

0t9 SEQUENCE DESCRIPTION; SEQ ID NO-W: 
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QCTTGCCTTG 

C2) INFORMATION FOR S&} ID NO:748: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE* truckle add 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

©MOLECULE TYPE: DNA 

W SEQUENCE DESCRIPTION: SEQ ID NO:748: 

CTTGCCTTGA 

(2) INFORMATION FOR SEQ ID NO:749: 

fl) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: midefc add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: Anew 

(5) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO:749: 

TTGCCTTCAG 

(2) INFORMATION FOR SEQ ID NO:750: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

©MOLECULE TYPE DNA 

(jd) SEQUENCE DESCRIPTION: SEQ ID ND:750: 

TOCCITGAGT 

(2) INFORMATION FOR SEQ ID NCh751: 

© SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bae pairs 

(B) TYPE* nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE- DNA 
(w) SEQUENCE DESCRIPTION: SEQ ID ND-75 1: 
GCCTTGAGTC 

0) INFORMATION FOR SEQ ID NO:752: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base nans 

(B) TYPE: imdckadxl 
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(Q STRANDEDNESS: single 
(D) TOPOLOGY; linear 

(U) MOLECULE TYPE DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:752: 

CCTTGA0TGC 

(2) INFORMATION FOR SEQ ID NO:7S3: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQIDNO:753: 

CTTQAOTGCT 

(2) INFORMATION FOR SEQ ID NO.-754; 

<D SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucteifi add 

(O STRANDEDNESS: tingle 
<D) TOPOLOGY: linear 

(2) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ IDNO:754: 

TTCAGTCCTT 

(2) WH3RMATION FOR SEQ ID N0555: 

CD SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 bue pahs 
<B) TYPE: nodefc tdd 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

01) MOLECULE TYPE: DNA 
(xl) SEQUENCE DESCSIPTTON: SEQ ID NQ^755: 
TOAOTGCnC 

C2) INFORMATION FOR SEQ ID NO:756: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pales 

(B) TYPE: crackle add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Hnesr 

00 MOLECULE TYPE: DNA 
(ri) SEQUENCE DESCRIPTION: SEQ IDNO:756: 
GAGTOCTICA 



<2) INFORMATION FOR SEQ ID NO:757: 

® SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 10 base pans 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: finear 

(il) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:757: 
AGTGCTTCAA 

(2) INFORMATION FOR SEQ ID NOr758 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: Duddo acid 

(O STRANDEDNESS: single 
CP) TOPOLOGY: linear 

(U) MOLECULE TYPE: BNA 

(aQ SEQUENCE DESCRIPTION: SEQ ID 140:758: 

GTOCITCAAO 

(2) INFORMATION FOR SEQ IDNO;759: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE- tsndeic add 

(O STRANDEDNESS: single 
(P) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SBQ ID NO-.759: 

TGCITCAAGT 

C2) INFORMATION FOR SEQ ID NO:76fc 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 10 base pans 
'(B) TYPE: nudefe add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(B MOLECULE TYPE: DNA . 
(xQ SEQUENCE DESCRIPTION: SBQ ID NO:760: 
GCTTCAAOTA 

(2) INFORMATION FOR SEQ ID NOTO 1 : 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pan 

(B) TYPE: mtdeie tdd 

(Q STRANDEDNESS: single 
<P) TOPOLOGY: (bear 
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(u) MOLECULE TYPE: UNA 
(ri) SEQUENCE DESCRIPTION: SEQ ID NO:761: 
CTTCAAGTAG 

(2) INFORMATION FOR SEQ ID NO:7fi2: 

(D SEQUENCE CHARACTERISTICS; 

(A) LENGTH; 10 tec pain 

(B) TYPE: Buckie add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: Bccar 

(0) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQJDNO:7fi2: 

TTCAAGTAGT 

(2) INFORMATION FOR SEQ ID ND:7*3: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bin pans 

(B) TYPE: nucteie add 

(O STRANDEDNESS: single 
(D) TOPOLOGY: linear 

05 MOLECULE TYPE: DNA 

(«D SEQUENCE DESCRIPTION: SEQ ID NOTO : 

TCAAGTAGTO 

0) INFORMATION FOR SEQlDNO:764r 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pahs 

(B) TYPE* fiockic add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: BNA 
W SBQUQICE DESCRIPTION: SEQ EDNO:764: 
CAAGTAGTCT 

(2) INFORMATION FOR SEQ ID NOTO5 : 

0) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: lObasepabs 
03) TYPE: nucleic acid 
(€) STRANDEDNESS: stogie 
(D) TOPOLOGY: linear 

00 MOLECWJB TYPE: CWA 

£d) SEQUENCE DESCRIPTION: SEQ ID NO:765: 

AAGTAGTOIXj 

Q) INFORMATION FOR SEQ ID NOTO& 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: mttleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY; linear 

(ti) MOLECULE TYPE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID NO:765: 
AGTAGTGTUT 

(2) INFORMATION FOR SEQ ID NO:767: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



(uf MOLECULE TYPE: DNA 
Od) SEQUENCE DESCRIPTION: SEQ IDNO:767: 
GTAGTGTGTG 

(2) INFORMATION FOR SEQIDNO:76l: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH- 10 base pairs 

(B) TYPE- nucWo add 

(Q STRANDEDNESS: single 
(D) TOPOLOGY: 



(u) MOLECULE TYPE: DNA 
(w) SEQUENCE DESCRIPTION: SEQ ID NO:768: 
TAOTOTGTCC 

(9 INFORMATION FOR SEQ FDN0369: 

(9 SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 10 base pain 
(B) TYPE: mjdric add 
(Q STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(H) MOLEtULE TYPE: DNA 

W SEQUENCE DESCRIPTION: SEQ ID ND:769: 

AGTG1G1UCC 

(2) INFORMATION FOR SEQ ID NOCT0: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 be* pain 

(B) TYPE: naddc add 

(O STRANDEDNESS: single 
(P) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 
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(xl) SEQUENCE DESCRIPTION: SEQ ID NO;770; 
GTGTGTGCCC 

(2) INFORMATION FOR SEQ ID NCfc771: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: lin 



(0) MOLECULE TYPE: DNA 
(w) SEQUENCE DESCRIPTION: SEQ ID NO:771: 
TGTGTCCCCG 

(2) INFORMATION FOR SEQ ID NOT772: 

Q) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: micble add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

(B> MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID M*772: 
OTOTGCCCGT 

(2) INFORMATION FOR SBQ ID NO:773: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nocUte acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: B&eir 

(D) MOLECULE TYPE: DNA 
W SEQUENCE DESCRIPTION: SEQ ID NOc773: 
TCTGCCCGTC 

CO INFORMATION FOR SEQ ID NOt774: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 htK pain 

(B) TYPE: unddc acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(H) MOLECULE TYPE: DNA 

(n) SEQUENCE DESCRIPTION: SEQ tt>NO:774: 

OTCCccortrr 

G) INFORMATION FOR SEQD>NO:775: 

(0 SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(O STRANDEDNESS: single 
CD) TOPOLOGY: ttnesr 

(9) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:775: 

TGCCOGTCTG 

(2) INFORMATION FOR SEQ ID N0.776: 

CO SEQUENCE CHARACTERISTICS: 
(A) LENGTH: IQ base pans 
CB) TYPE: nucleic add 
CP) STRANDEDNESS: llngle 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: SEQ ID NO:776: 

GCCCGTCTGT 

C2> INFORMATION FOR SEQ ID I«k777: 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TITO: nucleic add 

CQ STRANDEDNESS: single 
CP) TOPOLOGY: Docar , 

(ill MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQIDN0377: 

CCCGICIOTT 

(2) INFORMATION FOR SBQ ID J**77S: 

(I) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: midtlc add 

(C) STRANDEDNESS: singlo 
(P) TOPOLOGY: linear 

(QMOCECULE TYPE: DNA 

fxQ SEQUENCE DESCRIPTION: SEQ EDNO-778: 

COGTCTOTTO 

(2) INFORMATION FOR SEQ IDNCK779: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nuckie add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:779: 
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O GTCT GnO T 

(2) INFORMATION FOR SEQ ID NO780: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: imcfeic add 

(Q STRANDEDNESS: single 
<E) TOPOLOGY: linear 

(5) MOLECULE TYPE: DNA 

(*9 SEQUENCE DESCRIPTION: SEQ ID NO.780: 

GTCIXillUlU 

(2) INFORMATION FOR SEQ ID NOTOl: 

(5) SEQUENCE CHARACIERISTTCS: 
(A) LENfJIFL 10 base pairs 
<B) TYPE: oodek acid 
(O STRANDEDNESS: tingle 
<P) TOPOLOGY: I 



(S) MOLECULE TYPE: DNA 
M SEQUENCE DESCRIPTION: SEQ ID NO:781: 
T C lbllblU I 

(2) INFORMATION FOR SEQ ID NO:782: 

0) SEQUENCE CHARACIERlSnCS: 

(A) LENGTH: 10 base pairs 

(B) TYPE; nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: Bnear 

09 MOLECULE TYPE: DNA 
£□) SEQUENCE DESCRIPTION: SEQ IDNO:782: 
ClGlIblUTO 

CO INFORMATION FOR SEQ ID ND:7S3: 

<9 SBQUENCE CHARACTERISTICS: 

(A) UNCjIR: 10 base pahs 

(B) TYPE: aaxfcte add 

(C) STRANLLuNESS: single 

(D) TOPOLOGY: i 



09 MOLECULE TYPE: DNA 

fri) SEQUENCE DESCRIPTION: SEQ ID NC*783: 

TcnrcroTQA 

O) INFORMATION FOR SEQ ID NO:7W: 

CD SEQUENCE CHARACTERISTICS: 
CA) LENG TH: 10 base pain 
(B) TYPE: aockic add 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: faear 

(u) MOLECULE TYRE: DNA 
(xQ SEQUENCE DESCRIPTION: SEQ ID 
GTTQTGTGAC 

CZ) INFORMATION FOR SEQ ID NO:7B5: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH lObasepaiii 

(B) TYPE: nodeic add 

(C) STRANDEDNESS: ifaigle 

(D) TOPOLOGY: V 



(H) MOLECULE TYPE: DNA 
(Kf) SEQUENCE DESCRIPTION: SEQ ID NO:785: 
TTGTOTQACr 

(2) INFORMATION FOR SEQ ID NOrtStf: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic add 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: 



09 MOLECULE TYPE: DNA 
(x9 SEQUENCE DESC3UPTTON: SEQ ID NOT786: 
TtjTGTGACTC 

(Z> INFORMATION FOR SEQ ID N0387: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: flockkadd 

<Q STRANDEDNESS: single 
03) TOPOLOGY: 



(B) MOLECULE TYPE: DNA 
{*9 SEQUENCE DESCRIPTION; SEQ ID NO:787: 
GTGTOACTCT 

(2) INFORMATION FOR SEQ ID NQ7K: 

(9 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: DDctdc add 

{Q STRANDEDNESS: single 
(D) TOPOLOGY; fincar 

(G) MOLECULE TYPE- DNA 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:78S: 

TGTOTGACTC 
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(2) INFORMATION FOR SEQ ID NO:78fc 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: nocktc acid 

(C) STRANDEDNESS: liDglc 

(D) TOPOLOGY: linear 

(U) MOLECULE TYPE DNA 
(xl) SEQUENCE DESCRIPIION: SEQ ID NO:789: 
0TGTGACTCT 

(2) INFORMATION FOR SEQ IDNCfcTOO: 

0) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 10 base pain 
TO TYPE: nucleic add 
<C) STRANDEDNESS: I ingle 
(D) TOPOLOGY: linear 

01) MOLECULE TYPE: DNA 

(jd) SEQUENCE DESCRIPTION: SEQ ID NOr790: 
TQTGACTCTG 

(2) INFORMATION FOR SEQ IDNOTOl: 

<0 SEQUENCE OURACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: Pucjcfa add 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: linear 

00 MOLECULE TYPE: DNA 
(kQ SEQUENCE DESCRIPTION: SEQ ID NO:791: 
GTCACTCTOG 

(2) INFORMATION FOR SEQ ID NCH 92: 

CD SEQUENCE CHARACTERISTICS: 

(A) LENGTH: lObaaeptia 

(B) TYPE: nucleic add 

<0 STRANDEDNESS: itagk 
• (D) TOPOLOGY: linear 

09 MOLECULE TYPE: DNA 

CxJ) SEQUENCE DESCRIPTION: SEQ ID NO:792: 

TGACTCTGGT 

(Z) INFORMATION FOR SEQ ID N0393: 

(Q SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 bate pain 

(B) TYPE: nnddc add 

(Q STRANDEDMI3S: single 
(D)TOP<XjOGY: linear 



00 MOLECULE TYPE: DNA 
(x0 SEQUENCE DESCRIPTION: SEQ ID NO:793: 
GACTCTGGTA 

(2) INFORMATION FOR SEQ ID M*W: 

(I) SEQUENCE CHARACTERISTICS: 

(A) IENGIH: 10 base pain 

(B) TYPE* nnddc add 

(Q STRANDEDNESS: tingle 
(D)TOPOLOOY: 



(Jo) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ IDNO:794: 
ACFCTOGTAA 

(2) INFORMATION FOR SEQ ID NO:795: 

(0 SEQUENCE CHARACTERISnCS: 
CA) LENGTH: lObasepsta 
09) TYPE: fluddc add 
(Q STRANDEDNESS: tingle 
(D) TOPOLOGY: linear 

OQ MOLECULE TYPE: DNA 

(xJ) SEQUENCE DESCRIPTION: SEQ ID NO:795: 

CTCTGGTAAC 

(2) INFORMATION FOR SEQ ID NO:796: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pain 

(B) TYPE: rotddoadd 

(Q STRANDEDNESS: stogie 
(D) TOPOLOGY: Unear 

(U) MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO:796: 

TCIGG'LAACT 

(2) INFORMATION FORSEQ IDN0797: 

0) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: lObaseprin 
O) TYPE: nnddc arid 

(C) STRANDEDNESS: single 
O^ TOPOLOGY: finear 

0Q MOLECULE TYPE: DNA 

(d) SEQUENCE DESCRIPTION: SEQ ID NO:797: 

CIGGTAACIA 
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(2) INFORMATION FOR SEQ ID W*798: 

(0 SEQUENCE CHARACTERISTICS: 

(A) LENGTH I0b»epans 

(B) TYPE: nucleic tad 

(C) STKANDEDNESS: single 

(D) TOPOLOGY: linttr 

(H) MOLECULE TYPE: DMA 
(ri) SEQUENCE DESCRIPTION: SEQ ID NO:798: 
TGGTAACTAG 



(2) INFORMATION FOR SEQ ID NO:799: 

0) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: I0t*$oj*ia 

(B) TYPE- nodete add 

(O STRANDEDNESS: tingle 
(D) TOPOLOGY: Hour 

00 MOLECULE TYPE: DNA 

(xQ SEQUENCE DESCRIPTION: $EQIDNO:799: 



GGTAACTAGA 
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(2) INFORMATION FOR SBQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY i linear 

(ii) MOLECULE TYPE: UNA 

(Xi) SEQUENCE DESCRIPTION i SEQ ID NO: 600: 

TSGAAGGGCT AATTCACTCA CGGAAAAGAC CAGTTGAACC AGAAGAAGAT AOAAGAOGCC 60 

ATGAAGAAGA AAACAACAGA TT6TTCTGCT TGCTCAGCTG GGQACTTTCC AOAAGGCGCG 120 

GCCTGAGTGA CTAAGCOCCG TTGGGGACTT TCC6AAGAGG CATGAA6GGA CTTTCCAAGG 180 

CAGGCGTGGC CTGGGCGGGA CTG3GGAGTG GCGAGCCCTC AGATGCTGCA TATAAGCAGC 240 

TGCTTTCTGC CTGTACTGGQ TCTCTCTGGT TAGACCAGAT CTGAGCCTGG GAGCTCTCTG 300 

GCTA6CTAGG GAACCCACTG CTTAAGCCTC AATAAAGCTT GCCTTGAGTG CTTCAAGTAG 360 

TGTGTGCCCG TCTGTTGTGT GACTCTGGTA TCTAGAGATC CCTCAGACCA TTTTAGTCCG 420 

TGTGGAAAAT CTCTAGCAGT GGCGCCCGAA CAGGGACTTG AAAGCGAAAG GAAAACCAGA 480 

GGAGCTCTCT CGACGCAGGA CTCGGCTTGC TGAAGCGCGC ACGGCAAGAG GCGAGGGGCG 540 

GCGACTGGTQ AGTACGCCGA AAATTTTGAC TAGCGGAGGC TAGAAGGAGA GAGATGGGTG 600 

CGAGAGCGTC AATATTAAGC GGGGGAAAAT TAGATAOATG GGAGAAAATT CGGTTAAGGC 660 

CAGGAGGAAA GAAAAAGTAT AAATTAAAAC ATATAGTATG GGCAAGCAGG GAGCTAGAAC 720 

GATTCGCAGT CAATCCTGGC CTGTTGGAAA CATCAGAAGG CTGTAGACAA ATACTGGGAC 780 

AGTTACACCC GTCCCTTCAG ACAGGATCAG AAGAACTTAA ATCAGTATAT AATGCAGTAG 840 

CAGTCCTCTA TTGTGTGCAT CAAAACATAG ACATAAAGGA CACCAAGGAA GCTTTAGAAA 900 

AGATAOAGGA AOAGCAAAAC AAATGTAAGA AAAAAGCACA GCAAGCAGCA GCACAGCAAG 960 

CAGCAGCTGG CACAGGAAAC AGCAACCCGG TCAGCCAAAA TTACCCTATA GTACAGAACA 1020 

TGCA3GGGCA AATGGTACAT CAGGCCATAT CACCTAGAAC TTTAAATGCA TGGGTAAAAG 1080 

TAATAGAAGA OAAGGCTTTC AGCCCAGAGG TAATACCCAT GTTTTCAGCA TTATCAGAAG 1140 

GAGCCACCCC ACAAGATTTA AACACCATGC TAAACACAGT GGGGGGACAT CAAGCAGCTA 1200 

TGCAAATGTT AAAAGAGACC ATCAATGAGG AAGCTGCAGA ATGGGATAGA TTACATCCAG 1260 

CGCAGGCAGG GCCTGTTGCA CCAGGCCAGA TGAGAGACCC AAGGGGAAGT GACATAGCAG 1320 

GAACTACTAG TACCCTTCAG GAACAAATAO GATGGATGAC AGGTAATCCA GCTATCCCAG 1380 

TAGGAGAAAT CTATAAAAGA TGGATAATCC TGGGATTAAA TAAAATAGTA AGGATGTATA 1440 

GCCCTATCAG CATTCTGGAC ATAAAACAAG GACCAAAGGA ACCCTTTAGA GACTATGTAG 1500 

ACCGGTTCTA TAAAACTCTA AGAGCCGAGC AAGCTACACA GGAGGTAAAA AATTGGATGA 1560 

CAGAAACCTT GTTGGTCCAA AATGCAAACC CAGATTGTAA GACTATTTTA AAAGCATTGG 1620 
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GACCAGCAGC TACACTAGAA GAAATGATGA CAGCATGTCA GGGAGTGGGA GGACCCAGCC 1680 

ATAAAGCAAG AGTTTTGGCA GAAGCAATGA GCCAAGCAAC AAATGCAGCT ACTGTAATGA 1740 

TGCAGAGAAG CAATTTTAGA AACCAAAGAA AGAATCTTAA GTGTTTCAAT TGTGGCAAAG . 1600 

AAGGGCACAT AGCCAGAAAT TGCAGGGCTC CTAGGAAAAG GGGCTOTTGG AAATGTGGAA i860 

AGGAAGGACA CCAAATGAAA GATTGTACTG AGAGACAGGC TAATTTTTTA GGGAAAATCT 1920 

GGCCTTCCCA GAAGGGGAGG CCAGGGAACT TTCTTCA6AG CAGGCCAGAA CCAACAGCCC 1980 

CTCTCCAGGG CAGGCCGGAG CCATCAGCCC CGCCAGAAOA GAGCTTCA06 TTTGGGGAGG 2040 

AGACAACAAC TCCCTCTCAG AAGCAGGAGC CGATAGACAG GGACAGGGAT CTGTATCCTT 2100 

TRGCTTCCCT CAGATCACTC TTTGGCAACG ACCCCTCGTC ACAATAAAGA TAGGGGGGCA 2160 

GCTGAAGGAA GCTCTATTAG ATACAGGAGC AGATGAIACA GTATTAGAAG ACATGCATTT 2220 

GCCAGGAAAA TGGAAACCAA AAATGATAGG GGGAATTGGA GGTTTTATCA AAGTAAAACA 2280 

ATATGATGAA ATTCTTGTAG AAATCTGTGG ACATAAAGCT ATAGGTACAG TATTAGTAGG 2340 

ACCTAGACCT GTCAACATAA TTGGAAGAAA TCTGTTGACT CAGATTGGTT GCACTTTAAA 2400 

TTTTCCCATT AQTCCTACTG AAACTGTACC AGTACAATTA AAGCCAGGAA TGGATGGCCC 2460 

AAAGGTTAAA CAATGGCCAT TGACAGAAGA GAAAATAAAA GCATTMTAG AAATTTGTAC 2520 

AGAAATGGAA AAGGAAGGAA AGATTTCAAA AATTGGGCCT GAAAATCCAT ACAATACTCC 2580 

AGTATTTGCC ATAAAGAAAA AAGATGGTAC TAAATGGAGA AAATTAGTAG ATTTCAGAGA 2640 

CCTTAATAAG AGAACTGAAG ACTTCTGGGA AGTTCAATTA GGAATACCAC ATCCCTCAGG 2700 

ATTAAAAAAG AAAAAATCAG TAACAGTACT GG&TGTOGGT GATGCATACT TTTCAGTTCC 2760 

CTTAGATGAA AACTTCAGGA AGTATACTGC ATTTACCATA CCTAGTATAA ATAATGAGAC 2820 

ACCAGGGATT AGATATCAGT ACAATGTGCT TCCACAGGGA TGGAAAGGAT CACGAGGAAT 2880 

ATTCCAAAGT AGCATGACAA GAATCTTAGA GCCTTTTAGA AGACAAAATC CAGACATAGT 2940 

TATCTATCAA TACATGGATG ACTTGTATGT AGGATCTGAT TTAGAAATAG GACAGCAXAG 3000 

AATAAAAATA GAGGAACTGA GACAACATCT GTTGAAGTGG GGATTTACCA CACCAGACAA 3060 

AAAGCATCAG AAAGAACCCC CATTCCTTTG GATGGGTTAT GAACTCCATC CTGAXAAATG 3120 

GACAGTGCAA CCTATAGTAC TGCCAGAAAA AGACAGCTGG ACTGTCAATG ACATACAGAA 3180 

GTTAGTGGGT AAATTAAATT GGGCAASTCA GATTTACCCA GGAATTAAAG TAAGGCAATT 3240 

ATGTAAACTC CTTAGGGGAA CCAAAGGACT AACAGAAGTA ATACCACTAA CAGAAGAAGC 3300 

AGAGCTAGAA CTGGCAGAAA ACAGGGAAAT TCTAAGAGAA CCAGTACATG GAGTGTATTA 3360 

TGACCCATCA AAAGACTTAA TAGCASAAAT ACAGAAGCAG GAGCAAGGCC AATGGAjCATA 3420 

TCAAATTTAT CAAGATCAAT TTAAAAATCT AAAAACAGGA AAGTATGCAA GATTGAGGGG 3480 

TGCCCACACT AATGATGTAA AACAATTTCC AGAGGCAGTG CAAAAAATAG CCACAGAAAG 3540 

CATAGXAATA TGGGGAAAGA CTCCTAAATT TAGACTACCC ATACAAAAAG AAACATGGGA 3600 

CGCATGGTGG ACAOAGTATT GGCAAGCCAC CTGGATTCCT GACTGGGAGT TTGTCAATAC 3660 
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CCCTCCCCTA GTAAAATTAT GGTACCAGTT AGAAAAAGAA CCCATAATAG GAGCAGAAAC 3720 

TTTCTATGTA GATGGGGCAG CTAACAGAGA GACTAAATTA GGAAAAGCAG GATATGTTAC 3780 

TGACAGAGGA AGACAAAAAG TTGTCTCCCT AACTGACACA ACAAATCAGA AGACTGAGTT 3840 

ACAAGCAATT CATCTAGCTT TGCAGGATTC AGGATTAGAA GTAAACATAG TAACAGACTC 3900 

ACAGTATGCA TTAGGAATCA TTCAAGCACA ACCAGATAAA AGTGAATCAG AAATAGTCAA 3960 

TCAAATAATA GAGCAATTAA TAAAAAAGGA AAAGGTCTAC CTGGCATGGG TACCAGCACA 4020 

CAAAGGAATT GGAGGGAAIG AACAAGTAGA TAAATTAGTC AGTGCTGGAA TCAGGAAAAT 4080 

ACTATTTTTA GATGGAATAG ATAAGGCACA AGAAGGCCAT GAGAAATATC ACAGTAATTG 4140 

GAGAGCAATG GCTAGTGGTT TTAACCTGCC ACCTATAGTA GGAAAAGAAA TAGTAGCCAG 4200 

CTGTGATAAA TGTCAGCTAA AAGGAGAAGC CATGCATGGA CAAGTAGACT GTAGTCCAGG 4260 

AATATGGCAA CTAGATTGTA CACATCTAGA AGGAAAAATT ATCCTGGTAG CAGTTCATGT 4320 

AGCCAGTGGA TATATAGAAG CAGAAGTTAT TCCAGCAGAG ACAGGGCAGG AAACAGCATA 4380 

CTTTATCTTA AAATTAGCAG GAAGGTGGCC AGTAAACACA ATACATACAG ACAATGGCGG 4440 

CAATTTCATC AGTACCACGG TTAAGGCCGC CTOTTGGTGG GGAGGGAXGA AGCAGGAATT 4500 

TGGCATTCCC TACAATCCCC AAAGCCAAGG AGTAGTGGAA TCTATGAATA GAGAATTAAA 4560 

GAAAATTATA GGACAGGTAA GAGATCAGGC TGAACATCTT AAGACAGCAG TACAAATGGC 4620 

AGTATTCATC CACAATTTTA AAAGAAAAGG GGGGATTGGG GGATACAGTG CAGGGGAAAG 4680 

AATAGTAGAC ATAATAGCAA CAGACATACA AACTAAAGAA TTACAAAAGC AAATTACAAA 4740 

AATTCAAAAT TTTCGGGTTT ATTACAGGGA CAGCAGAGAT CCACTTTGGA AAGGACCAGC 4800 

AAAACTTCTC TGGAAAGGCG AAGGGGCAGT AGTAATACAA GATAATAGTG ACATAAAAGT 4860 

AGTGCCAAGA AGAAAAGTAA AGATCATTAG GGATTATGGA AAACAGATGG CAGGTGATGA 4920 

TT GTG T GQ CA AGTAOACAGG ATGAGGATTA GAACATGOAA CAGTTTAGTG AAACAGCATA 4980 

TGTATGTTTC AAAGAAAGCT AAGGGATGGA TTTAXAGACA TCACTATGAA AACACTCATC 5040 

CAAAAATAAG CTCAGAAGTA CACATCCCAC TAGGGGAAGC TAGATTGGTA ATAACAACAT 5100 

ATTGGGGTCT ACATACAGGA GAAAGAGACT GGCATTTGGO TCAGGGAGTC TCCATAGAAT 5160 

GGAGG6AAAG GACAIATAGA ACACAAGTAG ACCCCGAACT AGCAGACCAA CTAATTCATA 5220 

TACATTACTT TGATTGTTTT TCAGAATCTG CCATAAGAAG TGCCATATTA GGATATAGAG 5280 

TTAGGGATAG GTGT8AATAT CAAGCAGGAC ATAACAAGGT AGOATCTCTA GAGTACTTGG 5340 

CACTAACAGC ATTAATAACA CCAAAGAAGA TAAAGCCACC TTTGCCTAGT GTTGCGAAAC 5400 

TGACAGAGGA TAGATGGAAC AAGCCCCAGA AGACCAAGGG CCACAGAGGC AGCCATACAA 5460 

TOAATGGACA CTAGAACTTT TAGAGGAGCT TAAGAATGAA GCTGTTAGGC ATTTTCCTAG 5520 

GGTATGGCTC CATGGCTTAG GGCAACATAT CTATGAAACT TATGGGGATA CTTGGGAAGG 5580 

AGTGGAGGCC ATAACAAGAA CTCTGCAACA ACTGCTGTTT ATTCATTTCA GAATTGGGTG 5640 

TCAACAXAGC AGAATAGGCA TTATTCGACA GAGGAGAGCA AGAAATGGAG CCAGTAGATC 5700 
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CTAGACTAGA GCCCTGGAAG CATCCAGGAA GTCAGCCTAA GACTGCGTGT ACCACTTGCT 5760 

ATTGTAAAAA GTGCTGCTTT CATTGCCAAG TTTGTTTTAT GACAAAAGGC TTAGGCATCT 5820 

CCTATGGCAQ GAAGAAGOGG AGACAGCGAC GAAGAGCTCC TCAAGACAGT CAGACTCATC 5880, 

AAGCTTATCT ATCAAAGCAG TAAGTAATAT ATGTAATGCA ACCTTTACAA ATAGTAGCAA 5940 

TAGTAGCATT AGTAGTAGCA GGAATAATAG CAATAGTTGT GTGGACCATA GTATTCATAG 6000 

AATATAAGAA AAIAXTAAGA CAAAGAAAAA TAGACAGGTT GATTGATAGA ATAAGAGAAA 6060 

GAGCAGAAGA CAGTGGCAAT GACAGTGAAG GGGATCAGGA AGAATTATCG GCACTTGTGG 6120 

ACATGGGGCA CCATOATCCT TGGGATATTA ATGATCTGTA GAGCTGCAAA CAATTTGTGG 6180 

GTCACAGTCT ATTATGGGGT ACCTGTGTGG AGAGAAGCAA OCACCACTCT ATTTTGTGCA 6240 

TCAGATGCCA AGGCAIATGA TGGAGAGGTA CATAATGTTT GGGCCACACA TGCCTGTCTA 6300 

CCCACAGACC CTAACCCACA AGAAGTAGAA TTGAAAAATG TGACAGAAAA TTTTAACATG 6360 

TGGAAAAATA ACATGGTAOA ACAGATGCAT GAGGATATAA TCAGTTTATG GGATCAAAGC 6420 

CT6AAGCCAT GTGTAAAATT AACCCCACTC TGTGTTTCTT TAAATTGCAC TGATGCTACT 6480 

AAXACCAC1A ATAGTAATAC CACTAGGAGC AGCGAGAAAC CGAAGGGGAC AGGGGAAATA 6540 

AAAAACTGCT CTTTCAATAT CACCACAAGC ATAAGAGATA AGGTGCAGAA ACAAXATGCA 6600 

CTTTTTTATA GCCTTGATGT AGTACCAATG GATGATAATG ATAATAGTAC AAGCTATAGG 6660 

TTAATAAGTT GTAACACCTC AATCATTACA CAGGCCTGTC CAAAGATATC CTTTGAGCCA 6720 

ATTCCCATAC ATTATTGTGC CCCGGCTGGT TTTGCGATTC TAAAGTGTAA AGATAAAAGG 6780 

TTCAATGGAA AAGGACCATG TACAAGTGTC AGCACAG2AC AGTGTACACA TGGAATTAGG 6840 

CGAGTAGTAT CAACTCAACT GTTGTTAAAT GGCAGTCTAG CAGAAGAAGA GGTAGTAATT 6900 

AGATCTGACA ATTTTACGAA CAATGCTAAA ACGATAATAG TACAGCTGAG CAAATCTGTA 6960 

GAAATTACTT GTGTAAGACC CAACAACAAT ACAAGAAAAA GTATAAGTAT GGGACCAGGG 7020 

AGAGCATTTT AXACAACAGA AATAATAGGA GATATAAGAC AAGCATATTG TAACATTAGT 7080 

AAAGCAAACT GGACTGACAC TTTAGAACAG ATAGCTAGAA AATTAAGAGA ACAATTTGAG 7140 

AATAAAACAA TAGTCTTTAA GCCATCCTCA GGAGGGGACC CAGAAATTGT AACACAGTTT 7200 

TACAGTrTTA ATTGTGGAGG GGAATTTTTC TACTGTAATT CAACACAACT GTTTAATGGT 7260 

ACTT9GAATG GTACTTGGGT TAATGGTACT TGGAGTAGTA ATAATACGAC TGAXACTGGA 7320 

AATATCACAC TCCCATGCAG AATAAAACAA TTTATAAACA TGTGGCAGGA AGTAGGAAAA 7380 

GCAATGTATG COCCTCOCAT CAAAGGACAA ATTAAATGTA CATCAAATAT TACAGGGCTG 7440 

ATATTAACAA GAGATGGTGG TAACAAIAAC ACGACGAACG ACAACGAGAC CGAGACCTTC 7500 

AGACCTGGAG GAGGAGATAT GAGGGACAAT TGGAGAAGTG AATTATATAA ATATAAAGTA 7560 

GTACAAGTTG AACCATTAGG AGTAGCACCC ACCAAGGCAA AGAGAAGAGT GGTGCAAAGA 7620 

GAAAAAAGAG CAGTGGGAAT AGGAGCTATG TTCCTTGGGT TCTTAGGAGC AGCAGGAAGC 7680 

ACTATGGGCG CAGCGTCAGT GACGCTGAOG GTACAAGCCA GACAA7TATT GTCTGGTATA 7740 
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GTGCAGCAOC AGAACAATCT GCTGAGGGCT ATTGAGGCGC AACAGCATCT GTTGCAACTC 7800 

ACAGTCTGGG GCATCAAACA GCTCCAGGCA AGAGTCCTGG CTGTGGAAAG ATACCTAAGG 7060 

GATCAACAQC TCCTGGGACT TTGGGOTTGC TCTGGAAAAC TCATTTGCAC CACTACTGTG 7920 

CCTTGGAACA ATAGCTGGAQ TAATAAATCT CTGGAAACAA TTTGGGATAA CATGACCTGG 7980 

ATGCAGTGQG AAAGAOAAAT TGACAATTAC ACAAACATAA TATACACCTT AATTGAAGAA 8040 

TC6CAGAACC AACAAOAAAA AAATGAACTA GAATTATTGG AATTGGATAA ATGGGCAAAT 8X00 

TTGTGGAATT GQTTTAGTAT ATCAAACTGG CTATGGTATA TAAAATTATT CATAATGGTA 8160 

GTAGGAGGCT TGGTAGGTTT AAGAATAGTT TTTACTGTAC TTTCTATAGT TAATAGAGTT 8220 

AGGCAGGGAT ACTCACCATT ATCGTTTCAG ACCCACCTCC CAACCCCGAA GGGACCCGAC 8280 

AGGCCAGAAG GAATCGAAGA AGAAGGTGGA GAGAGAGACA GAGGCAGCTC CACTCGATTA 8340 

GTGCACGGAT TCTTAGCACT TTTCTGGGAC GACCTOAGGA GTCTGTGCCT CTTGAGCTAC 8400 

CACCACTTGA GAGACTTACT CTTGATTGTA ACGAGGATTG TGGAACTTCT GGGACGCAGG 8460 

GGATGGGAAG CCCTCAAATA CTGGTGGAAT CTCCTGCAGT ATTGGAGGCA GGAACTACAG 8520 

AAGAGTGCTG TTAGCTTGTT CAATGGCACG GCGATAGCAG TAGCTGAGGG GACAGATAGA 8S8Q 

GTTATAGAAG CTTTAOGAAG GGCTTATAGA GCTATTCTCC ACATACCTAG AAGAATAAGA 8640 

CAGGGCTTAG AAAGGGCTTT GCTATAAAAT GGGTGGCAAO TGGTCAGAAA OTAGTGTGGT 8700 

TAGAAGGCAT GTACCTTTAA GACAAGGCAG CTAXAGATCT TAGCCGCTTT TTAAAAGAAA 8760 

AGGGGGGACT GGAAGGGCTA ATTCACTCAC GGAAAAGACC AGTTGAACCA GAAGAAGATA 8820 

GAAGAGGCCA TGAAGAAGAA AACAACAGAT TGTTCTGCTT GCTCAGCTGG GGACTTTCCA 8880 

GAAGGCGCGG CCTGAGTGAC TAAGCCCCGT TGGGGACTTT CCGAAGAGGC ATGAAGGGAC 8940 

TTTCCAAGGC AGGCGTOGCC TGGGGGGGAC TGGGGAGTGG CGAGCCCTCA GATGCTGCAT 9000 

ATAAGCAGCT GCTTTCTGCC TGTACTGGGT CTCTCTGGTT AGACCAGATC TGAGCCTGGG 9060 

AGCTCTCTGG CTAGCTAGGG AACCCACTGC TTAAGCCTCA ATAAAGCTTG CCTTGAGTGC 9120 

TTCAAOTAGT GTGTGCCCGT CTGTTGTQTG ACTCTGGTAT CTAGAGATCC CTCAGACCAT 9180 

TTTAGTCCGT GTGGAAAATC TCTAGCA 9207 
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CLAIMS: 

1. An isolated HIV-1 strain or a component, part; fragment or derivative thereof 
which is substantially non-pathogenic. 

2. A strain of MV-1 according to claim 1 wherein said strain is capable of 
stimulating an immune response in a human or primate against at 1 east one glycoprotein 
on HIV-1 whilst not substantially reducing proliferative responses and cytokine 
production to a mitogen in said human or primate compared to a healthy, non-infected 
human or primate subject 

3 . A strain of HTV- 1 according to claim 2 which carries one or more mutations in 
its genome resulting in the inability to direct synthesis of at least one polypeptide or 
protein from a pathogenic strain of HIV-1 or results in reduced synthesis of said 
polypeptide or protein. 

4. A strain of HIV-1 according to claim 2 which canies one or more mutations in 
its genome resulting in the said genome directing synthesis of a truncated form of a 
polypeptide or protein from a pathogenic strain of HTV-L 

5. A strain of HIV-1 according to claim 3 or 4 wherein said strain carries a 
mutation in its iw/gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HIV-1 genome. 

6. A strain of HTV-1 according to claim 5 wherein said strain is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the ne/gene and/or LTR region 
of pathogenic HIV-1. 

7. A strain of HTV-1 according to claim 6 wherein said strain is capable of inducing 
an immune response to at least one of gag, pel and/or em. 
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8. A strain of HTV-1 according to claim 7 herein said strain carries a deletion in 
its genome of at least 10 nucleotides within die region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HTV-1 strain NL4-3. . s 

9. A strain of HTV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (i) 8830-8862; 

(ii) 9009-9035; 
(ui) 9019-9029; and 

(iv) 9033-9049. 



10. A strain of HTV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting o£ 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 
(viu) 9271-9370, 



11. A strain of HIV- 1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 

nucleotide (ix) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(xu) 9112-9204. 



12. A strain of HTV-1 according to claim 8 wherein said strain carries a deletion in 
its genome of at least 10 nucleotides from within a region selected from the list 
consisting of: 
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nucleotide (xiii) 9105-9224 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

13. A strain of HTV-I according to claim 8 wherein said deletion encompasses at 
least one of the following decanucleoddes from the nef gene of HTV-1 strain NL4-3: 



ATGGGTGGCA 


(SEQ ID MO: 2) ; 


TGGGTGGCAA 


(SEQ ID NO 


:3); 


GGGTGGCAAG 


(SEQ ID NO:4) ; 


GGTGGCAAGT 


(SEQ ID NO: 5); 


GTGGCAAGTG 


(SEQ ID NO: 6) ; 


TGGCAAGTGG 


(SEQ 


ID 


NO:7) 


GGCAAGTGGT 


(SEQ ID NO: 8) ; 


GCAAGTGGTC 


(SEQ ID NO: 9) ; 


CAAGTGGTCA 


(SEQ ID NO: 10) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO: 11) 


AGTGGTCAAA 


(SEQ ID NO: 12) ; 


GTGGTCAAAA 


(SEQ 


ID 


NO: 13) 


TGGTCAAAAA 


(SEQ ID NO: 14) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO: 15) 


GTCAAAAAGT 


(SEQ ID NO: 16) : 


TCAAAAAGTA 


(SEQ 


ID 


NO:17) 


CAAAAAGTAG 


(SEQ ID HO: 18) ; 


AAAAAGTAGT 


(SEQ ID NO: 19) ; 


AAAAGTAGTG 


(SEQ ID NO:20) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO: 21) 


AAGTAGTGTG 


(SEQ ID NO: 22} ; 


AGTAGTGTGA 


(SEQ 


ID 


NO: 23) 


GTAGTGTGAT 


(SEQ ID NO: 24) ; 


TAGTGTGATT 


(SEQ ID NO 


:25) ; 


AGTGTGATTG 


(SEQ ID NO: 26) ; 


GTGTGATTGG 


(SEQ 


ID 


NO: 27) 


TGTGATTGGA 


(SEQ ID NO: 28) ; 


GTGATTGGAT 


(SEQ 


ID 


NO:29) 


TGATTGGATG 


(SEQ ID NO:30) ; 


GATTGGATGG 


(SEQ 


ID 


NO: 31) 


ATTGGATGGC 


(SEQ ID NO: 32) ; 


TTGGATGGCC 


(SEQ 


ID 


NO: 33) 


TGGATGGCCT 


(SEQ ID NO: 34) ; 


GGATGGCCTG 


(SEQ 


ID 


NO: 35) 


GATGGCCTGC 


(SEQ ID NO: 36) ; 


ATG6CCTGCT 


(SEQ 


ID 


NO: 37) 


TGGCCTGCTG 


(SEQ ID NO: 38) ; 


GGCCTGCTGT 


(SEQ 


ID 


NO:39) 


GCCTGCTGTA 


(SEQ ID NO: 40) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO:41) 


CTGCTGTAAG 


(SEQ ID NO: 42) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO:43) 


GCTGTAAGGG 


(SEQ ID NO: 44); 


CTGTAAGGGA 


(SEQ 


ID 


NO: 45) 


TGTAAGGGAA 


(SEQ ID NO:46); 


GTAAGGQAAA 


(SEQ 


ID 


NO:47) 


TAAGGGAAAG 


(SEQ ID NO: 48); 


AAGGGAAAGA 


(SEQ 


ID 


NO:49) 


AGGGAAAGAA 


(SEQ ID NO:50) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 51) 


GGAAAGAATG 


(SEQ ID NO:52) ; 


GAAAGAATGA 


(SEQ 


ID 


NO: 53) 


AAAGAATGAG 


(SEQ ID NO: 54) ; 


AAGAATGAGA 


(SEQ 


ID 


NO: 55) 


AGAATOAGAC 


(SEQ ID NO: 56); 


GAATGAGACG 


(SEQ 


ID 


NO: 57) 


AATGAGACGA 


(SEQ ID NO:58); 


ATGAGACGAG 


(SEQ 


ID 


NO: 59) 


TGAGACGAGC 


(SEQ ID NO: 60) ; 


GAGACGAGCT 


(SEQ 


ID 


NO: 61) 
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AGACGAGCTG 


(SEQ 


ID 


NO: 62) 




ACGAGCTGAG 


(SEQ 


ID 


NO: 64) , 




GAGCT6A6CC 


(SEQ 


ID 


NO: 66) 




GCTGAGCCAG 


(SEQ 


ID 


NO:68) , 




TGAGCCAGCA 


(SEQ 


ID 


NO:70) 




AGCCAGCAGC 


(SEQ 


ID 


NO: 72) 




CCAGCAGCAG 


(SEQ 


ID 


NO: 74) 




AGCAGCAGAT 


(SEQ 


ID 


NO:76) 




CAGCAGATGG 


(SEQ 


ID 


NO: 78) 




GCAGATGGGG 


(SEQ 


ID 


NO:80) , 




AGATGGGGTG 


(SEQ 


ID 


NO: 82) , 




ATGGGGTGG6 


(SEQ 


ID 


NO: 84) , 




GGGGT6GGA6 


(SEQ 


ID 


NO: 86) ; 




GGTGGGAOCA 


(SEQ 


ID 


NO:88) ; 




TGGGAGCAGT 


(SEQ 


ID 


NO: 90) ; 


.GGAGCAGTAT 


(SEQ 


ID 


NO: 92) ; 




AGCAGTATCT 


(SEQ 


ID 


NO: 94) j 




CAGTATCTCG 


(SEQ 


ID 


NO: 96) ; 


GTATCTCGAG 


(SEQ 


ID 


NO: 98) ; 


ATCTCGAGAC 


(SEQ 


ID 


NO: 100) ; 


CTCGAGACCT 


(SEQ 


ID 


NO: 102) '; 


CGAGACCTAG 


(SEQ 


ID 


NO: 104) ; 


AQACCTAOAA 


(SEQ 


ID 


NO: 106) ; 


ACCTAGAAAA 


(SEQ 


ID 


NO: 108) ; 


CTAGAAAAAC 


(SEQ 


ID 


NO: 110) 


• 


AGAAAAACAT 


(SEQ 


ID 


NO: 112) ; 


AAAAACATGG 


(SEQ 


ID 


NO: 114) ; 


AAACATGGAG 


(SEQ 


ID 


NO: 116) ; 


ACATGGAGCA 


(SEQ 


ID 


NO: 118) ; 


ATGGAGCAAT 


(SEQ 


ID 


NO: 120) ; 


GGAGCAATCA 


(SEQ 


ID 


NO: 122) ; 






ID 


NO: 124) ; 


CAATCACAAG 


(SEQ 


ID 


NO: 126) ; 


ATCACAAGTA 


(SEQ 


ID 


NO: 128) ; 


CACAAGTAGC 


(SEQ 


ID 


NO: 130) ; 


CAAGTAGCAA 


(SEQ 


ID 


NO: 132) ; 


AGTAGCAATA 


(SEQ 


ID 


NO: 134) ; 


TAGCAATACA 


(SEQ 


ID 


NO: 136); 
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GACGAGCTGA 


(SEQ 


ID 


NO: 63) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO: 65) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO: 67) ; 


CT3AGCCAGC 


(SEQ 


ID 


NO: 69) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO:71) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO: 73) ; 


CAGCAGCAGA 


(SEQ 


ID 


NO: 75) ; 


GCAGCAGATG 


(SEQ 


ID 


NO: 77) ; 


AGCAGATGGG 


(SEQ 


ID 


NO: 79) ; 


CAGATGGGGT 


(SEQ 


ID 


NO: 81) ; 


GATGGGGTGG 


(SEQ 


ID 


NO: 83) ; 


TGGGGTGGGA 


(SEQ 


ID 


NO:85) ; 


GGGTGGGAGC 


(SEQ 


ID 


NO: 87) ; 


GTGGGAGCAG 


(SEQ 


ID 


NO: 89) ; 


GGGAGCAOTA 


(SEQ 


ID 


NO: 91) ; 


GAGCAGTATC 


(SEQ 


ID 


NO:93) ; 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) ; 


AGTATCTOGA 


(SEQ 


ID 


NO: 97) ; 


TATCTCGAGA 


(SEQ 


ID 


NO:99); 


TCTCOAGACC 


(SEQ 


ID 


NO: 101) ; 



TCGAGACCTA (SEQ ID NO: 103) ; 
GAGACCTAGA (SEQ ID NO: 105) ; 
GACCTAGAAA (SEQ ID NO: 107); 
CCTAGAAAAA (SEQ ID NO: 109) ; 
TAGAAAAACA (SEQ ID NO: 111) ; 
GAAAAACATG (SEQ ID NO : 113); 
AAAACATGGA (SEQ ID NO: 115) ; 
AACATGGAGC (SEQ ID NO: 117) ; 
CATGGAGCAA (SEQ ID NO: 119) ; 
TGGAGCAATC (SEQ ID NO: 121) ; 
GAGCAATCAC (SEQ ID NO: 123) ; 
GCAATCACAA (SEQ ID NO: 125); 
AATCACAAGT (SEQ ID NO: 127) ; 
TCACAAGTAG (SEQ ID NO: 129) ; 
ACAAGTAGCA (SEQ ID NO: 131) ; 
AAGTAGCAAT (SEQ ID NO: 133) ; 
GTAGCAATAC (SEQ ID NO: 135) ; 
AGCAATACAG (SEQ ID NO:137) ; 
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GCAATACAGC (SEQ ID NO: 13 8) ; 

AATACAGCAQ (SEQ ID NO: 140); 

TACAGCAGCT (SEQ ID NO: 142) ; 

CAGCAGCTAA (SEQ ID NO: 144); 

GCAGCTAACA (SEQ ID NO:146); 

AGCTAACAAT (SEQ ID N0:148); 

CTAACAATGC (SEQ ID NO: 150); 

AACAATGCTG (SEQ ID NO: 152); 

CAATGCTGCT (SEQ ID NO: 154) ; 

ATGCTGCTTG (SEQ ID NO: 156); 

GCTGCTTGTG (SEQ ID NO: 158); 

TGCTTGTGCC (SEQ ID NO: 160) ; 

CTTGTGCCTQ (SEQ ID NO: 162); 

TGTGCCTGGC (SEQ ID NO: 164); 

TGCCTGGCTA (SEQ ID NO: 166) ; 

CCTGGCTAGA (SEQ ID NO: 168) ; 

TGGCTAGAAG (SEQ ID NO: 170); 
GCTAGAAGCA (SEQ ID N0:172); 
TAGAAGCACA (SEQ ID NO: 174) ; 
GAAGCACAAG (SEQ ID NO: 176) ; 
AGCACAAGAG (SEQ ID NO: 178); 
CACAAGAGGA (SEQ ID NO: 180); 
CAAGAGGAGG (SEQ ID NO: 182); 
AGAGGAGGAA (SEQ ID NO: 184); 
AGGAGGAAGA (SEQ ID NOtl86) ; 
GAGGAAGAGG (SEQ ID NO: 188); 
GGAAGAGGTG (SEQ ID NO: 190); 
AAGAGGTGGG (SEQ ID NO:192); 
GAGGTGGGTT (SEQ ID NO: 194) ; 
GGTGGGTTTT (SEQ ID NO: 196) ; 
TGGGTTTTCC (SEQ ID NO: 198); 
GGTTTTCCAG (SEQ ID NO: 200) ; 
TTTTCCAGTC (SEQ ID NO: 202) ; 
TTCCAGTCAC (SEQ ID NO: 204) ; 
CCAGTCACAC (SEQ ID NO: 206) ; 
AGTCACACCT (SEQ ID NO: 2 08); 
TCACACCTCA (SEQ ID NO:210); 
ACACCTCAGG (SEQ ID NO: 212); 
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CAATACAGCA 


(SEQ 


ID 


NO:139) 


ATACAGCAGC 


(SEQ 


ID 


NO: 141) 


ACAGCAGCTA 


(SEQ 


10 


NO: 143) , 


AGCAGCTAAC 


(SEQ 


ID 


NO: 145) , 


CAGCTAACAA 


(SEQ 


ID 


NO: 147) , 


GCTAACAATG 


(SEQ 


ID 


NO:149) , 


TAACAATGCT 


(SEQ 


ID 


NO:151) j 


ACAATGCTGC 


(SEQ 


ID 


NO: 153) ; 


AATGCTGCTT 


(SEQ 


ID 


NO:155) ; 


TGCTGCTTGT 


(SEQ 


ID 


NO:157) ; 


CTGCTTGTGC 


(SEQ 


ID 


NO: 159) ; 


GCTTGTGCCT 


(SEQ 


ID 


NO: 161) ; 


TTGTGCCTGG 


(SEQ 


ID 


NO: 163) ; 


GTGCCTGGCT 


(SEQ 


ID 


NO: 165) ; 


GCCTGGCTAG 


(SEQ 


ID 


NO: 167) ; 


CTGGCTAGAA 


(SEQ 


ID 


NO: 169) ; 


GGCTAGAAGC 


(SEQ 


ID 


NO: 171) ; 


CTAGAAGCAC 


(SEQ 


ID 


NO:173) } 


AGAAGCACAA 


(SEQ 


ID 


NO:17S); 


AAGGACAAGA 


(SEQ 


ID 


NO: 177) ; 


GCACAAGAGG 


(SEQ 


ID 


NO s 179) ; 


ACAAGAGGAG 


(SEQ 


ID 


NO: 181) ; 


AAGAGGAGGA 


(SEQ 


ID 


NO: 183) ; 


GAGGAGGAAG 


(SEQ 


ID 


NO: 185) ; 


GGAGGAAGAG 


(SEQ 


ID 


NO:187).; 


AGGAAGAGGT 


(SEQ 


ID 


NO: 189} ; 


GAAGAGGTGG 


(SEQ 


ID 


NO: 191) ; 


AGAGGTGGGT 


(SEQ 


ID 


NO: 193) ; 


AGGTGGGTTT 


(SEQ 


ID 


NO: 195) ; 


GTGGGTTTTC 


(SEQ 


ID 


NO: 197) ; 


GGGTTTTCCA 


(SEQ 


ID 


NO: 199) ; 


GTTTTCCAGT 


(SEQ 


ID 


NO: 201) ; 


TTTCCAGTCA 


(SEQ 


ID 


NO: 2 03) ; 


TCCAGTCACA 


(SEQ 


ID 


N0:205) ; 


CAGTCACACC 


(SEQ 


ID 


NO:207) ; 


GTCACACCTC 


(SEQ 


ID 


NO:209) ; 


CACACCTCAG 


(SEQ 


ID 


NO:211) ; 


CACCTCAGGT 


(SEQ 


ID 


NO: 213) ; 
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ACCTCAGGTA (SEQ ID NO: 2 14); 
CTCAGGTACC (SEQ ID NO: 216} ; 
CAGGTACCTT (SEQ ID NO: 218); 
GGTACCTTTA (SEQ ID NO: 220}; 
TACCTTTAAG (SEQ ID NO: 222) ; 
CCTTTAAGAC (SEQ ID NO: 224) ; 
TTTAAGACCA (SEQ ID NO: 226); 
TAAGACCAAT (SEQ ID NO: 22 8) ; 
AGACCAATGA (SEQ ID NO: 230) ; 
ACCAATGACT (SEQ ID NO : 232); 
CAATGACTTA (SEQ ID NO: 234) ; 
ATGACTTACA (SEQ ID NO: 236); 
GACTTACAAG (SEQ ID NO: 238); 
CTTACAAGGC (SEQ ID NO: 240) ; 
TACAAGGCAG (SEQ ID NO: 242); 
CAAGGCAGCT (SEQ ID NO: 244) ; 
AGGCAGCTGT (SEQ ID NO: 246); 
6CAGCTGTA6 (SEQ ID NO: 248); 
AGCTGTAGAT (SEQ ID NO: 250); 
CTGTAGATCT (SEQ ID NO : 252); 
GTAGATCTTA (SEQ ID NO: 254); 
AGATCTTAGC (SEQ ID NO: 25 6) ; 
ATCTTAGCCA (SEQ ID NO: 258) ; 
CTTAGCCACT (SEQ ID NO: 260); 
TAGCCACTTT (SEQ ID NO: 262); 
GCCACTTTTT (SEQ ID NO:264); 
CACTTTTTAA (SEQ ID NO: 266); 
CTTTTTAAAA (SEQ ID NO: 268); 
TTTTAAAAGA (SEQ ID NO: 270) ; 
TTAAAAGAAA (SEQ ID NO: 272) ; 
AAAAGAAAAG (SEQ ID NO: 274); 
AAGAAAAGGG (SEQ ID NO: 276); 
GAAAAGGGGG (SEQ ID NO: 278); 
AAAGGGGGGA (SEQ ID NO:280) ; 
AGGGGGGACT (SEQ ID NO: 282) ; 
GGGGGACTGG (SEQ ID NO: 284) ; 
GGGACTGGAA (SEQ ID NO: 286) ; 
GACTGGAAGG (SEQ ID NO: 288) ; 
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CCTCAGGTAC 


(SEQ 


ID 


NO: 215) ; 


TCAGGTACCT 


(SEQ 


ID 


NO: 217} ; 


AGGTACCTTT 


(SEQ 


ID 


NO: 219) ; 


GTACCTTTAA 


(SEQ 


ID 


NO: 221) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO -.223) ; 


CTTTAAGACC 


(SEQ 


ID 


NO: 225) ; 


TTAAGACCAA 


(SEQ 


ID 


NO: 227) ; 


AAGACCAATG 


(SEQ 


ID 


NO: 229) ; 


GACCAATGAC 


(SEQ 


ID 


NO: 231) ; 


CCAATGACTT 


(SEQ 


ID 


NO:233) ; 


AATGACTTAC 


(SEQ 


ID 


NO:235) ; 


TGACTTACAA 


(SEQ 


ID 


NO: 237) ; 


ACTTACAAGG 


(SEQ 


ID 


NO: 239) ; 


TTACAAGGCA 


(SEQ 


ID 


NO:241); 


ACAAGGCAGC 


(SEQ 


ID 


NO:243) ; 


AAGGCAGCTG 


(SEQ 


ID 


NO:245) ; 


GGCAGCTGTA 


(SEQ 


ID 


NO:247) ; 


CAGCTGTAGA 


(SEQ 


ID 


NO: 249) ; 


GCTGTAGATC 


(SEQ 


ID 


NO: 251) ; 


TGTAGATCTT 


(SEQ 


ID 


NO: 253) ; 


TAGATCTTAG 


(SEQ 


ID 


NO: 255} ; 


GATCTTAGCC 


(SEQ 


ID 


NO: 257) ; 


TCTTAGCCAC 


(SEQ 


ID 


NO: 259) ; 


TTAGCCACTT 


(SEQ 


ID 


NO: 261) ; 


AGCCACTTTT 


(SEQ 


ID 


NO: 263) ; 


CCACTTTTTA 


(SEQ 


ID 


NO:265) ; 


ACT1TITAAA 


(SEQ 


ID 


NO: 267) ; 


TTTTTAAAAG 


(SEQ 


ID 


NO: 269) ; 


TTTAAAAGAA 


(SEQ 


ID 


NO: 271) ; 


TAAAAGAAAA 


(SEQ 


ID 


NO: 273) ; 


AAAGAAAAGG 


(SEQ 


ID 


NO: 275) ; 


AGAAAAGGGG 


(SEQ 


ID 


NO:277} ; 


AAAAGGGGGG 


(SEQ 


ID 


NO: 279) ; 


AAGGGGGGAC 


(SEQ 


ID 


NO: 281) ; 


GGGGGGACTG 


(SEQ 


ID 


NO: 283) ; 


GGGGACTGGA 


(SEQ 


ID 


NO: 285) ; 


GGACTGGAAG 


(SEQ 


ID 


NO: 287) ; 


ACTGGAAGGG 


(SEQ 


ID 


NO:289) ; 
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CTGGAAGGGC (SEQ ID NO: 290); 
GGAAGGGCTA (SEQ ID NO: 292) ; 

AAGGGCTAAT (SEQ ID NO: 294) ; 

GGGCTAATTC (SEQ ID NO:296) ; 

GCTAATTCAC (SEQ ID NO:298); 

TAATTCACTC (SBQ ID NO: 3 00) ; 

ATTCACTCCC (SEQ ID NO: 302); 

TCACTCCCAA (SEQ ID NO: 304) ; 

ACTCCCAAAG (SEQ ID NO : 306); 

TCCCAAAGAA (SEQ ID NO:308); 

CCAAAGAAGA (SEQ ID NO:310); 

AAAOAAGACA (SEQ ID NO:312) ; 

AGAAGACAAG (SEQ ID NO: 3X4); 

AAGACAAGAT (SEQ ID NO: 316) ; 

GACAAGATAT (SEQ ID NO: 318) ; 

CAAGATATCC (SEQ ID NO: 320) ; 

AGATATCCTT (SEQ ID NO: 322); 

ATATCCTTGA (SEQ ID NO: 324) ; 

ATCCTTGATC (SEQ ID NO: 326) ; 

CCTTGATCTG (SEQ ID NO: 328) ; 

TTGATCTGTG (SEQ ID NO:330); 

GATCTGTGGA (SEQ ID NO: 332) ; 

TCTGTGGATC (SEQ ID NO: 334) ; 

TGTGGATCTA (SEQ ID NO: 336) ; 

TGGATCTACC (SEQ ID NO:338) ; 
GATCTACCAC (SEQ ID NO: 340); 
TCTACCACAC (SBQ ID NO: 342) ; 
TACCACACAC • (SEQ ID NO: 344) ; 
CCACACACAA (SEQ ID NO: 346) ; 
ACACACAAGG (SEQ ID NO: 34 8) ; 
ACACAAGGCT (SEQ ID NO:350) ; 
ACAAGGCTAC (SEQ ID NO: 352) ; 
AAGGCTACTT (SEQ ID NO: 354) ; 
GGCTACTTCC (SEQ ID NO: 356) ; 
CTACTTCCCT (SEQ ID NO:358) ; 
ACTTCCCTGA (SEQ ID NO: 360) ; 
TTCCCTGATT (SEQ ID NO: 362) ; 
CCCT6ATTG6 (SEQ ID NO: 3 64) ; 
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TGGAAGGGCT (SEQ ID NO: 291) ; 

GAAGGGCTAA (SEQ ID NO:293); 

AGGGCTAATT (SEQ ID NO:295) ; 

GGCTAATTCA (SEQ ID NO: 297) ; 

CTAATTCACT (SEQ ID NO:299); 

AATTCACTCC (SEQ ID NO: 3 01) ; 

TTCACTCCCA (SEQ ID NO:303); 

CACTCCCAAA (SEQ ID NO: 305) ; 

CTCCCAAAGA (SEQ ID NO: 307) ; 

CCCAAAGAAG (SEQ ID NO: 3 09) ; 

CAAAGAAGAC (SEQ ID NO:311) ; 
AAGAAGACAA (SEQ ID NO: 313) ; 
GAAGACAAGA (SEQ ID NO: 315) ; 
AGACAAGATA {SEQ ID NO: 317) ; 
ACAAGATATC (SEQ ID NO: 319) ; 
AAGATATCCT (SEQ ID NO:321); 
GATATCCTTG (SEQ ID NO:323) ; 
TATCCTTGAT (SEQ ID NO:32S); 
TCCTTGATCT (SEQ ID NO: 327) ; 
CTTGATCTGT (SEQ ID NO: 329) ; 
TGATCTGTGG (SEQ ID NO: 331) ; 
ATCTGTGGAT (SEQ ID NO: 333) ; 
CTGTGGATCT (SEQ ID NO: 335) ; 
6TQ0ATCTAC (SEQ ID NO: 337) ; 
GGATCTACCA (SEQ ID NO: 339) ; 
ATCTACCACA (SEQ ID NO: 341) ; 
CTACCACACA (SEQ ID NO: 343) ; 
ACCACACACA (SEQ ID NO: 34 5) ; 
CACACACAAO (SEQ ID NO: 347) ; 
CACACAAGGC (SEQ ID NO: 34 9) ; 
CACAAGGCTA (SEQ ID NO: 351) ; 
CAAGGCTACT (SEQ ID NO:353); 
AGGCTACTTC (SEQ ID NO: 355) ; 
GCTACTTCCC (SEQ ID NO: 357) ; 
TACTTCCCTG (SEQ ID NO: 359) ; 
CTTCCCTGAT (SEQ ID NO: 361) ; 
TCCCTGATTG (SEQ ID NO: 363) ; 
CCTGATTGGC (SEQ ID NO: 365); 
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CTGATTGGCA (SEQ ID NO: 366) j 
GATTGGCAGA (SEQ ID NO: 368) j 
TTGGCAGAAC (SEQ ID NO:370) ; 
GGCAGAACTA (SEQ ID NO: 372) ; 
CAGAACTACA (SEQ ID NO: 374); 
GAACTACACA (SEQ ID NO: 376); 
ACTACACACC (SEQ ID NO: 378); 
TACACACCAG (SEQ ID NO: 380); 
CACACCAGGG (SEQ ID NO: 382); 
CACCAGGGCC (SEQ ID NO: 384); 
CCAGGGCCAG (SEQ ID NO: 386) ; 
AGGGCCAGGG (SEQ ID NO:388); 
GGCCAGGGGT (SEQ ID NO: 390); 
CCAGGGGTCA (SEQ ID NO: 392); 
AGGGGTCAGA (SEQ ID NO: 394) ; 
GGGTCAGATA (SEQ ID NO:396); 
GTCAGATATC (SEQ ID NO: 398); 
CAGATATCCA (SEQ ID NO: 400) ; 
GATATCCACT (SEQ ID NO: 402) ; 
TATCCACTGA (SEQ ID NO:404) ; 
TCCACTGACC (SEQ ID NO:406) ; 
CACTGACCTT (SEQ ID NO: 408); 
CTGACCTTTG (SEQ ID NO: 410) ; 
GACCTTTGGA (SEQ ID NO: 412) ; 
CCTTTGGATG (SEQ ID NO: 414) ; 
TTTG6ATG6T (SEQ ID NO: 416) ; 
TGGATGGTGC (SEQ ID NO: 418) ; 
GATGGTGCTA- (SEQ ID NO: 420) ; 
TGGTGCTACA (SEQ ID NO: 422) ; 
GTGCTACAAG (SEQ ID NO: 424) ; 
GCTACAA6CT (SEQ ID NO: 426) ; 
TACAAGCTAG (SEQ ID NO:428) ; 
CAAGCTAGTA (SEQ ID NO: 430) ; 
AGCTAGTACC (SEQ ID NO: 43 2) ; 
CTAGTACCAG (SEQ ID NO: 434) ; 
AGTACCA6TT (SEQ ID NO: 43 6) ; 
TACCAGTTGA (SEQ ID NO: 438} ; 
CCAGTTGAGC (SEQ ID NO: 440); 
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TGATTGGCAG 


(SEQ 


ID 


NO:367 


ATTGGCAGAA 


(SEQ 


ID 


NO:369 


TGSCAGAACT 


(SEQ 


ID 


NO: 371 


GCAGAACTAC 


(SEQ 


ID 


NO: 373 


AGAACTACAC 


(SEQ 


ID 


NO: 375 


AACTACACAC 


(SEQ 


ID 


NO: 377 


CTACACACCA 


(SEQ 


ID 


NO: 379 


ACACACCAGG 


(SEQ 


ID 


NO: 381 


ACACCAGGGC 


(SEQ 


ID 


NO: 383 


ACCAGGGCCA 


(SEQ 


ID 


NO: 385 


CAGGGCCAGG 


(SEQ 


ID 


NO:387. 


GGGCCAGGGG 


(SEQ 


ID 


NO: 389! 


GCCAGGGGTC 


(SEQ 


• ID 


N0:391 


GAGGGGTGAG 


(SEQ 


ID 


NO: 393! 


GGGGTCAGAT 


(SEQ 


ID 


NO: 395] 


GGTCAGATAT 


(SEQ 


ID 


NO:397] 


TCAGATATCC 


(SEQ 


ID 


NO: 399] 


AGATATCCAC 


(SEQ 


ID 


N0:401] 


ATATCCACTG 


(SEQ 


ID 


NO: 403) 


ATCCACTGAC 


(SEQ 


ID 


NO: 405) 


CCACTGACCT 


(SEQ 


ID 


NO: 407) 


ACTGACCTTT 

•»W A W*» w W AAA 


(SEQ 


ID 


NO: 409] 


£ W>\»wA X A WW 


(SEO 


ID 


NO: 411] 


ACCTTTGGAT 

AAA WW*%A 


(SEQ 


ID 


NO: 413] 


W JL A A WWJ & A WW 


(SEO 


ID 


NO: 415] 


TTGGATGGTG 

A A A A MB 


(SEQ 


ID 


NO: 417) 


GGATGGTGCT 


(SEQ 


ID 


NO: 419) 


ATGGTGCTAC 


(SEQ 


ID 


NO: 421] 


GGTGCTACAA 


(SEQ 


ID 


NO: 423) 


TGCTACAAGC 


(SEQ 


ID 


NO: 425) 


CTACAAGCTA 


(SEQ 


ID 


NO: 427) 


ACAAGCTAGT 


(SEO 
\ ****** 


ID 


NO: 429) 


AAGCTAGTAC 


(SEQ 


ID 


NO: 431) 


GCTAGTACCA 


(SEQ 


ID 


NO: 433] 


TAGTACCAGT 


(SEQ 


ID 


NO:435] 


GTACCAGTTG 


(SEQ 


ID 


NO:437] 


ACCAGTTGAG 


(SEQ 


ID 


NO: 439] 


CAGTTGAGCC 


(SEQ 


ID 


NO: 441] 
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AGTTGAGCCA 

TTGAGCCAGA 

GAGCCAGATA 

GCCAGATAAG 

CAGATAAGGT 

GATAAGGTAG 

TAAGGTAGAA 

AGGTAGAAGA 

GTAGAAGAGG 

AGAA6AGGCC 

AAGAGGCCAA 

GAGGCCAATA 

GGCCAATAAA 

CCAATAAAGG 

AATAAAGGAG 

TAAAGGAGAG 

AAGQAGAGAA 

GGAGAGAACA 

AGAGAACACC 

AGAACACCAG 

AACACCAGCT 

CACCA6CTTG 

CCAGCTTGTT 

AGCTTGTTAC 

CTTGTTACAC 

TGTTACACCC 

TTACACCCTG 

ACACCCTGTG 

ACCCTGTGAG 

CCTGTGAGCC 

TGTGAGCCTG 

TGAGCCTGCA 

A6CCTGCAT6 

CCTGCATGGA 

TGCATGGAAT 

CATGGAATGG 

TGGAATGGAT 

GAATGGATGA 



(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
{SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 
(SEQ ID NO 



442) 

444) 

446) 

448) 

450) 

452) 

454) 

456) 

458) 

460) 

462) 

464) 

466) 

468) 

470) 

472) ; 

474) ; 

476) ; 

478) ; 

480) 

482) 

484) 

486) 

488) 

490) 

492) / 

494) 

496) 

498) 

500) 

502) 

504) 

506) 

508) 

510) 

512) 

514) 

516) 



GTTGAGCCAG 

TGAGCCAGAT 

AGCCAGATAA 

CCAGATAAGG 

AGATAAGGTA 

ATAAGGTAGA 

AAGGTAGAAG 

GGTAGAAGAG 

TAGAAGAGGC 

GAAGAGGCCA 

AGAGGCCAAT 

AGGCCAATAA 

GCCAATAAAG 

CAATAAAGGA 

ATAAAGGAGA 

AAAGGAGAGA 

AGGAGAGAAC 

GAGAGAACAC 

GAGAACACCA 

GAACACCAGC 

ACACCAGCTT 

ACCAGCTTGT 

CAGCTTGTTA 

GCTTGTTACA 

TTGTTACACC 

GTTACACCCT 

TACACCCTGT 

CACCCTGTGA 

CCCTGTGAGC 

CTGTGAGCCT 

GTGAGCCTGC 

GAGCCTGCAT 

GCCTGCATGG 

CTGCATGGAA 

GCATGGAATG 

ATGGAATGGA 

GGAATGGATG 

AATGGATGAC 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ. ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 443] 
NO: 445] 
NO:447) 
NO:449) 
NO: 451) 
NO:453] 
NO: 455] 
NO:457) 
NO:459] 
NO:461) 
NO:463) 
NO: 465) 
NO:467) 
NO:469] 
NO: 4 71) 
NO:473] 
NO: 475 ; 
NO: 4 77) 
NO: 479] 
NO: 481) 
NO: 483) 
NO: 485) 
NO:487) 
NO: 489] 
NO: 491] 
NO: 493] 
NO:495) 
NO: 497) 
NO:499) 
NO: 5 01) 
NO:503) 
NO: 505) 
NO: 507) 
NO: 509) 
NO: 511) 
NO: 513] 
NO: SIS) 
NO: 517) 
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ATGGATGACC (SEQ ID NO: 518 
GGATGACCCT (SEQ ID NO: 520 
ATGACCCTGA (SEQ JD NO: 522 
OACCCTGAGA (SEQ ID NO: 524 
CCCTGAGAGA (SEQ ID NO: 526, 
CTGAGAGAGA (SEQ ID NO: 528 
GAGAGAGAAG (SEQ ID NO: 530! 
GAGAGAAGTG (SEQ ID NO: 532] 
GAGAAGTGTT (SEQ ID NO: 534] 
OAAGTGTTAG (SEQ ID NO: 536] 
AGTGTTAGAG (SEQ ID NO: 53 8] 
TGTTAGAGTG (SEQ ID NO: 540] 
TTAGAGTGGA (SEQ ID NO: 542] 
AGAGTGGAGG (SEQ ID NO: 544] 
AGTGGAGGTT (SEQ ID NO: 546] 
TGGAGGTTTG (SEQ ID NO: 54 8] 
GAGGTTTGAC (SEQ ID NO: 550] 
GGTTTGACAG (SEQ ID NO: 552] 
TTTGACAGCC (SEQ ID NO: 554] 
TGACAGCCGC (SEQ ID NO: 556] 
ACAGCCGCCT (SEQ ID NO: 558] 
AGCCGCCTAG (SEQ ID NO: 560) 
CCGCCTAGCA (SEQ ID NO: 562) 
GCCTAGCATT (SEQ ID NO: 564) 
CTAGCATTTC (SEQ ID NO: 566) 
AGCATTTCAT (SEQ ID NO: 568) 
CATTTCATCA (SEQ ID NO: 570) 
TTTCATCACG (SEQ ID NO: 572) 
TCATCACGTG (SEQ ID NO: 574) 
ATCACGTGGC (SEQ ID NO: 576) 
CACGTGGCCC (SEQ ID NO: 578) 
CGTGGCCCGA (SEQ ID NO: 580) 
TGGCCCGAGA (SEQ ID NO: 582) 
GCCCGAGAGC (SEQ ID NO: 584) 
CCGAGAGCTG (SEQ ID .NO: 58 6) 
GAGAGCTGCA (SEQ ID NO: 588) 
GAGCTGCATC (SEQ ID NO: 590) 
GCTGCATCCG (SEQ ID NO: 592) 
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; TGGATGACCC (SEQ ID NO: 519) ; 

GATGACCCTG (SEQ ID NO: 521) ; 

TGACCCTGAG (SEQ ID NO: 523) ; 

ACCCTGAGAG (SEQ ID NO: 525) ; 

CCTGAGAGAG (SEQ ID NO:527) ; 

TGAGAGAGAA (SEQ ID NO: 529) ; 
; AGAGAGAAGT (SEQ ID NO: 531) ; 
; AGAGAAGTGT (SEQ ID NO: 533); 
; AGAAGTGTTA (SEQ ID NO: 535) ; 
; AAGTGTTAGA (SEQ ID NO:537) ; 

GTGTTAGAGT (SEQ ID NO: 539) ; 

GTTAGAGTGG (SEQ ID NO: 541) ; 
; TAGAGTGGAG (SEQ ID NO: 543) ; 
; GAGTGGAGGT (SEQ ID NO: 545); 
; GTGGAGGTTT (SEQ ID NO: 547) ; 
; GGAGGTTTGA (SEQ ID NO: 549) ; 
; AGGTTTGACA (SEQ ID NO: 551) ; 
; GTTTGACAGC (SEQ ID NO: 553) ; 
; TTGACAGCCG (SEQ ID NO: 555) ; 
; GACAGCCGCC (SEQ ID NO: 557) ; 

CAGCCGCCTA (SEQ ID NO: 559) ; 

GCCGCCTAGC (SEQ ID NO: 561) ; 

CGCCTAGCAT (SEQ ID NO: 563) ; 
; CCTAGCATTT (SEQ ID NO: 565) ; 

TAGCATTTCA (SEQ ID NO: 567) ; 
; GCATTTCATC (SEQ ID NO:569) ; 
; ATTTCATCAC (SEQ ID NO: 571) ; 
; TTCATCACGT (SEQ ID NO: 573); 
; CATCACGTGG (SEQ ID NO: 575); 
; TCACGTGGCC (SEQ ID NO: 577); 
; ACGTGGCCCG (SEQ ID NO: 579) ; 
; GTGGCCCGAG (SEQ ID NO: 581) ; 
; GGCCCGAGAG (SEQ ID NO: 583) ; 

CCCGAGAGCT (SEQ ID NO:585) ; 
; CGAGAGCTGC (SEQ ID NO: 587) ; 

AGAGCTGGAT (SEQ ID NO:589) ; 

AGCTGCATCC (SEQ ID NO: 591) ; 
; CTGCATCGGG (SEQ ID NO: 593); 
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TGCATCCGGA (SEQ ID NO: 594) ; 

CATCCGGAGT (SEQ ID NO:596); 

TCCGGAGTAC (SEQ ID NO: 598); 

CG6AGTACTT (SEQ ID NO:600); 

GAGTACTTCA (SEQ ID NO: 602); 

GTACTTCAAG (SEQ ID NO: 604); 
ACTTCAAGAA (SEQ ID NO: 606) ; 
TTCAAGAACT (SEQ ID NO: 608); 
CAAGAACTGC (SEQ ID NO: 610); 
AGAACTGCTG (SEQ ID NO: 612) ; 



GCATCCGGAG (SEQ 
ATCCGGAGTA (SEQ 
CCGGAGTACT (SEQ 
GGAGTACTTC (SEQ 
AGTACTTCAA (SEQ 
TACTTCAAGA (SEQ 
CTTCAAGAAC (SEQ 
TCAAGAACTG (SEQ 
AAGAACTGCT (SEQ 
GAACTGCTGA (SEQ 



ID NO: 595) ; 
ID NO: 597) ; 
ID NO: 599) ; 
ID NO: 601) ; 
ID NO: 603) ; 
ID NO: 605) ; 
ID NO: 607) ; 
ID NO: 609) ; 
ID NO: 611) ; 
ID NO: 613) . 



14. A strain of HIV- 1 according to claim 8 wherein said deletion encompasses at 
least one of the following decanocleotides from the LTR region of HTV-1 strain NL4-3: 



GCTTTTTGCC 

TTTTTGCCTG 

TTTGCCTGTA 

TGCCTGTACT 

CCTGTACTGG 

TGTACTGGGT 

TACTGGGTCT 

CTGGGTCTCT 

GGGTCTCTCT 

GTCTCTCTGG 

CTCTCTGGTT 

CTCTGGTTAG 

CTGGTTAGAC 

GGTTAGACCA 

TTAGACCAGA 

AGACCAGATC 

ACCAGATCTG 

CAGATCTGAG 

GATCTGAGCC 

TCTGAGCCTG 

TGAGCCTGGG 

AGCCTGGGAG 

CCTGGGAGCT 

TGGGAGCTCT 



(SEQ ID NO: 652) ; 

(SEQ ID NO: 654) ; 

(SEQ ID NO: 656) ; 

(SEQ ID NO: 658) ; 

(SEQ ID NO: 660) ; 

(SEQ ID NO: 662) ; 

{SEQ ID NO: 664) ; 

(SEQ ID NO: 666) ; 

(SEQ ID NO: 668) ; 

(SEQ ID NO: 670) ; 
(SEQ ID NO: 672) ; 
(SEQ ID NO: 674) ; 
(SEQ ID NO: 676) ; 
(SEQ ID NO: 678) ; 
(SEQ ID NO: 680) ; 
(SEQ ID NO: 682) ; 
(SEQ ID NO: 684) ; 
(SEQ ID NO: 686) ; 
(SEQ ID NO: 688) ; 
(SEQ ID NO: 690) ; 
(SEQ ID NO: 692) ; 
(SEQ ID NO: 694) ; 
(SEQ ID NO:696); 
(SEQ ID NO: 698) ; 



CTTTTTGCCT 


(SEQ 


ID NO: 653) ; 


TTTTGCCTGT 


(SEQ 


ID NO: 655) ; 


TTGCCTGTAC 


(SEQ 


ID NO: 657) ; 


GCCTGTACTG 


(SEQ 


ID NO: 659) ; 


CTGTACTGGG 


(SEQ 


ID NO: 661) ; 


GTACTGGGTC 


(SEQ 


ID NO: 663) ; 


ACTGGGTCTC 


(SEQ 


ID NO: 665) ; 


TGGGTCTCTC 


(SEQ 


ID NO: 667) ; 


GGTCTCTCTG 


(SEQ 


ID NO: 669) ; 


TCTCTCTGGT 


(SEQ 


ID NO: 671) ; 


TCTCTGGTTA 


(SEQ 


ID NO: 673) ; 


TCTCTGGTTA 


(SEQ 


ID NO: 675) ; 


TGGTTAGACC 


(SEQ 


ID NO: 677) ; 


GTTAGACCAG 


(SEQ 


ID NO: 679) ; 


TAGACCAGAT 


(SEQ 


ID. NO: 681) ; 


GACCAGATCT 


(SEQ 


ID NO: 683) ; 


CCAGATCTGA 


(SEQ 


ID NO: 685) ; 


AGATCTGAGC 


(SEQ 


ID NO: 687) ; 


ATCTGAGCCT 


(SEQ 


ID NO: 689) ; 


CTGAGCCTGG 


(SEQ 


ID NO: 691) ; 


GAGCCTGGGA 


(SEQ 


ID NO: 693) ; 


GCCTGGGAGC 


(SEQ 


ID NO:695); 


CTGGGAGCTC 


(SEQ 


ID NO:697) ; 


GGGAGCTCTC 


(SEQ 


ID NO: 699) ; 
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GGAGCTCTCT 

AGCTCTCTGG 

CTCTCTGQCT 

CTCTGGCTAA 

CTGGCTAACT 

GGCTAACTAG 

CTAACTAGGG 

AACTAGGGAA 

CTAGGGAACC 

AGGGAACCCA 

GGAACCCACT 

AACCCACTGC 

CCCACTGCTT 

CACTGCTTAA 

CTGCTTAAGC 

GCTTAAGCCT 

TTAAGCCTCA 

AAGCCTCAAT 

GCCTCAATAA 

CTCAATAAAG 

CAATAAAGCT 

ATAAAGCTTG 

AAAGCTTGCC 

AGCTTGCCTT 

CTTGCCTTGA 

TGCCTTGAGT 

CCTTGAGTGC 

TTGAGTGCTT 

GAGTGCTTCA 

GTGCTTCAAG 

GCTTCAAGTA 

TTCAAGTAGT 

CAAGTAGTGT 

AGTAGTGTGT 

TAGTGTGTGC 

GTGTGTGCCC 

GTGTGCCCGT 

GTGCCCGTCT 



(SEQ ID NO: 700) ; 
(SEQ ID NO: 702} ; 
(SEQ ID NO: 7 04) ; 
(SEQ ID NO: 706); 
(SEQ ID NO: 708) ; 
(SEQ ID NO: 710) ; 
(SEQ ID NO: 712) ; 
(SEQ ID NO: 714) ; 
(SEQ ID NO: 716) ; 
(SEQ ID NO: 718) ; 
(SEQ ID NO: 720) ; 
(SEQ ID NO: 722); 
(SEQ ID NO: 724} ; 
(SEQ ID NO: 726) ; 
(SEQ ID NO: 728) ; 
(SEQ ID NO: 730) ; 
(SEQ ID NO: 732) ; 
(SEQ ID NO: 734) ; 
(SEQ ID NO: 736) ; 
(SEQ ID NO: 738) ; 
(SEQ ID NO: 740) ; 
(SEQ ID NO: 742) ; 
(SEQ ID NO: 744) ; 
(SEQ ID NO: 746) ; 
(SEQ ID NO: 74 8) ; 
(SEQ ID NO: 750) ; 
(SEQ ID NO: 752) ; 
(SEQ ID NO: 754) ; 
(SEQ ID NO: 756) ; 
(SEQ ID NO: 758) ; 
{SEQ ID NO: 760) ; 
(SEQ ID NO: 762) ; 
(SEQ ID NO: 764) ; 
(SEQ ID NO: 766) ; 
(SEQ ID NO: 768} ; 
(SEQ ID NO: 770) ; 
(SEQ ID NO: 772) ; 
(SEQ ID NO: 774) ; 



GAGCTCTCTG 

GCTCTCTGGC 

TCTCTGGCTA 

TCTGGCTAAC 

TGGCTAACTA 

GCTAACTAGG 

TAACTAGGGA 

ACTAGGGAAC 

TAGGGAACCC 

GGGAACCCAC 

GAACCCACTG 

ACCCACTGCT 

CCACTGCTTA 

ACTGCTTAAG 

TGCTTAAGCC 

CTTAAGCCTC 

TAAGCCTCAA 

AGCCTCAATA 

CCTCAATAAA 

TCAATAAAGC 

AATAAAGCTT 

TAAAGCTTGC 

AAGCTTGCCT 

GCTTGCCTTG 

TTGCCTTGAG 

GCCTTGAGTG 

CTTGAGTGCT 

TGAGTGCTTC 

AGTGCTTCAA 

TGCTTCAAGT 

CTTCAAGTAG 

TCAAGTAGTG 

AAGTAGTGTG 

GTAGTGTGTG 

AGTGTGTGCC 

TGTGTGCCCG 

TGTGCCCGTC 

TGCCOGTCTG 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ZD 
(SEQ ID 
(SEQ ID 
(SEQ ID. 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
{SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 701) 
NO: 703] 
NO: 705] 
NO: 707) 
NO: 709) 
NO:711) 
NO:713) 
NO: 715) 
NO : 717) 
NO: 719] 
NO: 721) 
NO: 723] 
NO: 725) 
NO: 727) 
NO: 729) 
NO:731) 
NO: 733) 
NO:735) 
NO:737) 
NO:739) 
NO: 741) 
NO: 743) 
NO:745] 
NO:747) 
NO: 749} 
NO: 751) 
NO: 753] 
NO: 755) 
NO:757) 
NO: 759) 
N0:761) 
NO: 763] 
NO: 765) 
NO:767) 
NO: 769) 
N0:771) 
NO: 773] 
NO:775) 
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GCCCGTCTGT 


(SEQ 


ID 


NO: 776) ; 


CCGTCTGTTG 


(SEQ 


10 


NO: 778) ; 


GTCTGTTGTG 


(SEQ 


ID 


NO: 780) ; 


CTGTTGTGTQ 


(SEQ 


ID 


NO: 782) ; 


GTTGTGTGAC 




ID 


NO: 784} ; 


TGTGTGACTC 


(SEO 


ID 




TGTGTGACTC 


(SEQ 


ID 


NO:788) ; 


TGTGACTCTG 


(SEQ 


ID 


NO:790) ; 


TGACTCTGGT 


(SEQ 


ID 


NO:792) ; 


ACTCTGGTAA 


{SEQ 


ID 


»0:794) ; 


TCTGGTAACT 


(SEQ 


ID 


NO:796) ; 


TGGTAACTAG 


(SEQ 


ID 


NO:798) ; 



CCCOTCTGTT 


(SEQ 


ID 


NO: 777) ; 


CGTCTGTTGT 


(SEQ 


ID 


NO: 779) ; 


TCTGTTGTGT 


(SEQ 


ID 


NO: 781) ; 


TGTTGTGTGA 


(SEQ 


ID 


NO: 783) ; 


TTGTGTGACT 


(SEQ 


ID 


NO: 785) ; 


GTGTGACTCT 


(SEO 


ID 




GTGTGACTCT 


(SEQ 


ID 


NO:789) ; 


GTGACTCTGG 


(SEQ 


ID 


NO: 791) ; 


GACTCTGGTA 


(SEQ 


ID 


NO: 793) ; 


CTCTGGTAAC 


(SEQ 


ID 


NO:795) ; 


CTGGTAACTA 


(SEQ 


ID 


NO: 797) ; 


GGTAACTAGA 


(SEQ 


ID 


NO:799) . 



15. A strain of HIV- 1 according to claim 8 having the identifying characteristics of 
HIV-1 isolate ECACC V9410I706 or HIV-1 isolate ECACC V94103 1 169 or isolate C54 
deposited at ECACC on 14 February, 1995 under Accession Number . 

16. An isolated strain of HIV-1 which is reactive to antibodies to a glycoprotein of 
HIV-1, is capable of inducing an immune response to at least one of gag, pot and/or env 
and which is incapable of directing synthesis of a nef gene product or a full length nef 
gene product 

17. An isolated strain of HIV- 1 according to claim 16 wherein the glycoprotein is 
at least one of gp4M5, gpI20 and/or gpl60. 

18. A nucleic acid molecule or apart, fragment or derivative thereof from an HIV-1 
isolate defined in any one of claims 1 to 16. 

19. A molecular infectious clone comprising a nucleic acid molecule according to 
claim 18. 

20. A method for inhibiting or reducing productive infection of an individual by a 
pathogenic strain of HIV-1, said method comprising administering to a subject a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 
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target cells carrying DNA derived from said non-pathogenic HTV-1. 



21. A method according to claim 20 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HIV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject 

22. A method according to claim 21 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of 

23. A method according to claim 17 wherein said isolate carries one or more 
mutations in its genome resulting in said genome directing synthesis of a truncated form 
of a polypeptide or protein from a pathogenic strain of HTV-1. 

24. A method according to claim 22 or 23 wherein said HTV-1 isolate carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HTV-1 genome. 

25. A method according to claim 24 wherein said HIV-1 isolate is reactive to 
antibodies to gp41-45, gpl20 and/or gpl60 of HIV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gent and/or LTR region 
of pathogenic HIV-1 . 

26. A method according to claim 25 wherein said HTV-1 isolate is capable of 
inducing an immune response to at least one of gag f pol and/or em. 

27. A method according to claim 26 wherein said HIV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 8787 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 
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28. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (I) 8830-8862; 

(0) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 

29. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected tram the list consisting of: 
nucleotide (v) 9281-9371; 

(v0 9281-9362; 

(vii) 9105-9224; and 

(viii) 9271-9370. 

30. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide 0x) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 

(xii) 9112-9204. 

31. A method according to claim 27 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

32. A method according to claim 27 wherein said deletion encompasses at least one 
of the following decamiclcotides from the nef gene of HTV-1 strain NL43: 



ATGGGTGGCA (SEQ ID HO: 2) 
GGGTGGCAAG (SEQ ID 170:4} 
GTGGCAAGTG (SEQ ID NO: 6) 
GGCAAGTG6T (SEQ ID NO: 8) 



TGGGTGGCAA (SEQ ID NO: 3) 

GGTGGCAAGT (SEQ ID NO: 5) 

TGGCAA6TGG (SEQ ID NO: 7) 

GCAAGTGGTC (SEQ ID NO: 9) 
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CAAGTGGTCA 


(SEQ ID NO: 10] 


i 


AAGTGGTCAA 


(SEQ 


ID 


NO: 11) ; 


AGTGGTCAAA 


(SEQ 


ID 


NO: 12] 


f ; 


GTGGTCAAAA 


(SEQ 


ID 


NO:13) / 


TGGTCAAAAA 


(SEQ 


ID 


NO: 14; 




GGTCAAAAAG 


(SEQ 


ID 


NO: 15) ; 


GTCAAAAAGT 


(SEQ 


ID 


NO: 16) 




TCAAAAAGTA 


(SEQ 


ID 


NO: 17); 


CAAAAAGTAG 


(SEQ ID NO: 18) 




AAAAAGTAGT 


(SEQ ID NO: 19) ; 


AAAAGTAGTG 


(SEQ 


ID 


NO: 20) 


• 


AAAGTAGTGT 


(SEQ 


ID 


NO: 21) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO:22) 




AGTAGTGTGA 


(SEQ 


ID 


NO:23) ; 


GTAGTGTGAT 


(SEQ ID NO: 24) 


• 


TAGTGTGATT 


(SEQ ID NO:25) ; 


AGTGTGATTG 


(SEQ 


ID 


NO:26) 


• 


GTGTGATTGG 


(SEQ 


ID 


NO.-27) ; 


TGTGATTGGA 


(SEQ 


ID 


NO:28) 




GTGATTGGAT 


(SEQ 


ID 


NO:29) ; 


TGATTGGATG 


(SEQ 


ID 


NO:30) 




GATTGGATGG 


(SEQ 


ID 


NO:31) ; 


ATTGGATGGC 


(SEQ 


ID NO: 32) 




TTGGATGGCC 


(SEQ 


ID 


NO:33) f 


TGGATGGCCT 


(SEQ ID NO: 34) 


m 


GGATGGCCTG 


(SEQ 


ID 


NO: 35); 


GATGGCCTGC 


(SEQ 


ID 


NO: 36) 


m 


ATGGCCTGCT 


(SEQ 


ID 


NO: 37) ; 


TGGCCTGCT6 


(SEQ 


ID 


NO: 38) 




GGCCTGCTGT 


(SEQ 


ID 


NO:39) ; 


GCCTGCTGTA 


(SEQ 


ID 


NO: 40) 


i 


CCTGCTGTAA 


(SEQ 


ID 


NO:41) ; 


CTGCTGTAAG 


(SEQ 


ID 


NO: 42) 


/ 


TGCTGTAAGG 


(SEQ 


ID 


NO:43) ; 


GCTGTAAGGG 


(SEQ 


ID 


NO: 44) 




CTGTAAGGGA 


(SEQ 


ID 


NO: 45) ; 


TGTAAGGGAA 


(SEQ 


ID 


NO: 46) 




GTAAGGGAAA 


(SEQ 


ID 


NO:47) ; 


TAAGGGAAAG 


(SEQ 


ID 


NO: 48) 




AAGGGAAAGA 


(SEQ 


ID 


NO:49) ; 


AGGGAAAGAA 


(SEQ 


ID 


NO: 50) 




GGGAAAGAAT 


(SEQ 


ID 


NO: 51) ; 


GGAAAGAATG 


(SEQ 


ID 


NO: 52) 




GAAAGAATGA 


(SEQ 


ID 


NO:53) ; 


AAAGAATGAG 


(SEQ 


ID 


NO: 54) 




AAGAATGAGA 


(SEQ 


ID 


NO:55) ; 


AGAATGAGAC 


(SEQ 


ID 


NO: 56) 




GAATGAGACG 


(SEO 


ID 


NO: 57) ; 


AATGAGACGA 


(SEQ 


ID 


NO: 58) 


* 


ATGAGACGAG 


(SEQ 


ID 


NO:59) ; 


TGAGACGAGC 


(SEQ 


ID 


NO: 60) 


9 


GAGACGAGCT 


(SEQ 


ID 


NO: 61) ; 


AQAG6A6CTG 


(SEQ 


ID 


NO: 62) 


• 
t 


GACGAGCTGA 


(SEQ 


ID 


NO: 63) ; 


ACGAGCTGAG 


(SEQ 


ID 


NO: 64) 


; 


CGAGCTGAGC 


(SEQ 


ID 


NO: 65) ; 


GAGCTGAGCC 


(SEQ 


ID 


NO: 66) 


; 


AGCTGAGCCA 


(SEQ 


ID 


NO: 67) ; 


GCTGAGCCAG 


(SEQ 


ID 


NO: 68) 


; 


CTGAGCCAGC 


(SEQ 


ID 


NO: 69) ; 


TGAGCCAGCA 


(SEQ 


ID 


NO: 70) 




GAGCCAGCAG 


(SEQ 


ID 


NO: 71) ; 


AGCCAGCAGC 


(SEQ ID NO: 72) 




GCCAGCAGCA 


(SEQ 


ID 


NO:73) ; 


CCA6CA6CAG 


(SEQ 


ID 


NO: 74) 




CAGCAGCAGA 


(SEQ 


ID 


NO: 75) ; 


AGCAGCAGAT 


(SEQ 


ID NO: 76} 




GCAGCAGATG 


(SEQ 


ID 


NO: 77) ; 


CAGCAGATGG 


(SEQ 


ID 


NO: 78) 




AGCAGATGGG 


(SEQ 


ID 


NO: 79); 


GCAGATGGGG 


(SEQ 


ID 


NO: 80) 




CAGATGGGGT 


(SEQ 


ID 


NO: 81) ; 


AGATGGGGTG 


(SEQ 


ID 


NO: 82) 




GATGGGGTGG 


(SEQ 


ID 


NO:83) ; 


ATGGGGTGGG 


(SEQ 


ID 


NO: 84) 




TGGGGTGGGA 


(SEQ 


ID 


NO:85); 
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GGGGTGGGAG 
GGTGGGAGCA 
TGGGAGCAGT 
GGAGCAGTAT 
AGCAGTATCT 
CAGTATCTCG 
GTATCTCGAG 
ATCTCGAGAC 
CTCGAGACCT 
CGAGACCTAG 
AGACCTAGAA 
ACCTAGAAAA 
CTAGAAAAAC 
AGAAAAACAT 
AAAAACATGG 
AAACATGGAG 
ACATGGAGCA 
ATGGAGCAAT 
GGAGCAATCA 
AGCAATCACA 
CAATCACAAG 
ATCACAAGTA 
CACAAGTAGC 
CAAGTAGCAA 
AGTAGCAATA 
TAGCAATACA 
GCAATACAGC 
AATACAGCAG 
TACAGCAGCT 
CAGGAGCTAA 
GCAGCTAACA 
AGCTAACAAT 
CTAACAATGC 
AACAATGCTG 
CAATGCTGCT 
ATGCTGCTTG 
GCTGCTTGTO 
TGCTTGTGCC 



(SEQ ID 
{SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 86) 
NO: 88) 
NO: 90) 
NO: 92) 
NO: 94) 
NO: 96) 
NO: 98) 
NO: 100) 
NO: 102) 
NO: 104) 
NO: 106) 
NO: 108) 
NO: 110) 
NO: 112) 
NO: 114) ; 
NO: 116) 
NO: 118) 
NO: 120) 
NO: 122) 
NO: 124) ; 
NO: 126) 
NO: 128) 
NO: 130) 
NO: 132) 
NO: 134) ; 
NO: 136) 
NO:138) 
NO: 140) 
NO: 142) ; 
NO: 144) ; 
NO: 14 6) ; 
NO: 148); 
NO: 150) ; 
NO: 152) ; 
NO: 154) ; 
NO: 156) ; 
NO: 158); 
NO: 160) ; 



6G6TG6GA6C 
GTGGGAGCAG 
GGGAGCAGTA 
GAGCAGTATC 
GCAGTATCTC 
AGTATCTCGA 
TATCTCGAGA 
TCTCGAGACC 
TCGAGACCTA 
GAGACCTAGA 
GACCTAGAAA 
CCTAGAAAAA 
TAGAAAAACA 
GAAAAACATG 
AAAACATGGA 
AACATGGAGC 
CATGGAGCAA 
TGGAGCAATC 
GAGCAATCAC 
GCAATCACAA 
AATCACAAGT 
TCACAAGTAG 
ACAAGTAGCA 
AAGTAGCAAT 
GTAGCAATAC 
AGCAATACAG 
CAATACAGCA 
ATACAGCAGC 
ACAGCAGCTA 
AGCAGCTAAC 
CAGCTAACAA 
GCTAACAATG 
TAACAATGCT 
ACAAT6CT6C 
AATGCTGCTT 
TGCTGCTTGT 
CTGCTTGTGC 
GCTTGTGCCT 
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NO:87) 


ID 


NO: 89) 


ID 


NO: 91) 


ID 


NO: 93) 


ID 


NO: 95) 
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NO: 97) 


ID 


NO:99) 


ID 


NO: 101) 



(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 
(SEQ 

(SEQ ID NO: 103) ; 
(SEQ ID NO: 105) ; 
(SEQ ID NO: 107) ; 
(SEQ ID NO: 109) ; 
(SEQ ID NO: 111); 
(SEQ ID NO: 113) ; 
(SEQ ID NO: 115) ; 
(SEQ ID NO: 117) ; 
(SEQ ID NO: 119) ; 
(SEQ ID NO: 121) ; 
(SEQ ID NO: 123) ; 
(SEQ ID NO: 125) ; 
(SEQ ID NO: 127) ; 
(SEQ ID NO: 129) / 
(SEQ ID NO: 131) ; 
(SEQ ID NO: 133) ; 
(SEQ ID NO:135) ; 
(SEQ ID NO: 137) ; 
(SEQ ID NO: 139) ; 
(SEQ ID NO:141) ; 
(SEQ ID NO: 143) ; 
(SEQ ID NO: 145) ; 
(SEQ ID NO: 147) ; 
(SEQ ID NO: 149) ; 
(SEQ ID NO: 151) ; 
(SEQ ID NO: 153) ; 
(SEQ ID NO: 155) ; 
(SEQ ID NO: 15 7) ; 
(SEQ ID NO: 159) ; 
(SEQ ID NO: 161) ; 
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CTTGTGCCTG (SEQ ID NO: 162); 
TGTGCCTGGC (SEQ ID NO: 164) ; 
TGCCTGGCTA (SEQ ID NO: 166) ; 
CCTGGCTAGA (SEQ ID NO: 168); 
TGGCTAGAAG (SEQ ID NO: 170); 
GCTAGAAGCA (SEQ ID NO:172); 
TAGAAGCACA (SEQ ID NO: 174) ; 
GAAGCACAAG (SEQ ID NO: 176) ; 
AGCACAAGAG (SEQ ID NO: 178) ; 
CACAAGAGGA (SEQ ID NO: 180) ; 
CAAGAGGAGG (SEQ ID NO:182); 
AGAGGAGGAA (SEQ ID NO: 184); 
AGGAGGAAGA (SEQ ID NO:186) ; 
GAGGAAGAGG (SEQ ID NO: 188) ; 
GGAAGAGGTG (SEQ ID NO: 190) ; 
AAGAGGTGGG (SEQ ID NO: 192) ; 
GAGGTGGGTT (SEQ ID NO: 194) ; 
GGTGGGTTTT (SEQ ID NO: 196) ; 
TGGGTTTTCC (SEQ ID NO: 198) ; 
GGTTTTCCAG (SEQ ID NO: 200) ; 
TTTTCCAGTC (SEQ ID NO:202) ; 
TTCCAGTCAC (SEQ ID NO: 204) ; 
CCAGTCACAC (SEQ ID NO:206) ; 
AGTCACACCT (SEQ ID NO: 208) ; 
TCACACCTCA (SEQ ID NO: 210) ; 
ACACCTCAGG (SEQ ID NO: 2 12); 
ACCTCAGGTA (SEQ ID NO: 214) ; 
CTCAGGTACC (SEQ ID NO:216) ; 
CAGGTACCTT (SEQ ID NO:21B) ; 
GGTACCTTTA (SEQ ID NO: 220) ; 
TACCTTTAAG (SEQ ID NO: 222) ; 
CCTTTAAGAC (SEQ ID NO:224) ; 
TTTAAGACCA (SEQ ID NO: 226) ; 
TAAGACCAAT (SEQ ID NO: 228) ; 
AGACCAATGA (SEQ ID NO: 230); 
ACCAATGACT : (SEQ ID NO: 232) ; 
CAATGACTTA (SEQ ID NO: 234) ; 
ATGACTTACA .{SEQ ID NO: 236) ; 
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CAGTCACACC 


(SEQ 


ID 


NO:207/ ; 
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(SEQ 


ID 


NQ:209) ; 


CACACCTCAG 


(SEQ 


ID 


NO:211) ; 


CACCTCAGGT 


isjeuq 


1U 


£il/:2X3; ; 
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XJJ 




AVJW A W All 


(SEQ 


ID 


NO:219) ; 


GT A CCTTT AA 


(SEQ 


ID 


NO:221) ; 


ACPTTTAAGA 


(SEQ 


ID 


NO:223) ; 


CTTTAAGACC 


(SEQ 


11; 


MU:^4S3| ; 


TTAAGACCAA 


(SEQ 


ID 


NO: 227) ; 


AAGACCAATG 


(SEQ 


ID 


NO: 229) ; 


GACCAATGAC 


(SEQ 


ID 


NO: 231) ; 


CCAATGACTT 


(SEQ 


ID 


NO:233) ; 


AATGACTTAC 


(SEQ 


ID 


NO: 235) ; 


TGACTTACAA 


(SEQ 


ID 


NO: 237) ; 
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GACTTACAAG (SEQ ID NO: 23 8) ; 

CTTACAAGGC (SEQ ID NO: 240) ; 

TACAAGGCAG (SEQ ID NO: 242) ; 

CAAG0CA6CT (SEQ ID NO: 244); 

A6GCAGCTGT (SEQ ID NO: 246) ; 

GGAGCTGTAG (SEQ ID NO: 248); 

AGCTGTAGAT (SEQ ID NO: 250) ; 

CTGTAGATCT (SEQ ID NO: 252) ; 

GTAGATCTTA (SEQ ID NO: 2 54) ; 

AGATCTTAGC (SEQ ID NO: 256); 

ATCTTASCCA (SEQ ID NO:258); 

CTTAGCCACT (SEQ ID NO: 260); 

TAGCCACTTT (SEQ ID NO: 262) ; 

GCCACTTTTT (SEQ ID NO: 264) ; 

CACTTTTTAA (SEQ ID NO: 266} ; 

CTTTTTAAAA (SEQ ID NO: 268); 

TTTTAAAAGA (SEQ ID NO: 270) ; 

TTAAAAGAAA (SEQ ID NO: 272) ; 
AAAAGAAAAG (SEQ ID NO: 274); 
AAGAAAAGGG (SEQ ID NO: 276) ; 
GAAAAGGGGG (SEQ ID NO: 278) ; 
AAAGGGGGGA (SEQ ID NO: 280) ; 
AGGGGGGACT (SEQ ID NO: 282} ; 
GGGGGACTGG (SEQ ID NO: 284) ; 
GGGACTGGAA (SEQ ID NO: 286} ; 
GACTGGAAGG (SEQ ID NO: 288) ; 
CTGGAAGGGC (SEQ ID NO: 290) ; 
GGAAGGGCTA (SEQ ID NO: 292) ; 
AAGGGCTAAT (SEQ ID NO:294) ; 
GGGCTAATTC (SEQ ID NO:296) ; 
GCTAATTCAC (SEQ ID NO: 298) ; 
TAATTCACTC (SEQ ID NO:300) ; 
ATTCACTCCC (SEQ ID NO: 302) ; 
TCACTCCCAA (SEQ ID NO: 3 04); 
ACTCCCAAAG (SEQ ID NO:306) ; 
TCCCAAAGAA (SEQ ID NO: 308) ; 
CCAAAGAAGA (SEQ ID NO: 310); 
AAAGAAGACA (SEQ ID NO:312) ; 
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ACTTACAAGG (SEQ ID NO: 23 9) 
TTACAAGGCA (SEQ ID NO: 241) 
ACAAGGCAGC (SEQ ID NO: 243) 
AAGGCAGCTG (SEQ ID NO: 245), 
GGCAGCTGTA (SEQ ID NO: 247); 
GAGCTGTAGA (SEQ ID NO: 249) ; 
GCTGTAGATC (SEQ ID NO: 251), 
TGTAGATCTT (SEQ ID NO: 253), 
TAGATCTTAG (SEQ ID NO: 255) j 
GATCTTAGCC (SEQ ID NO: 257) ; 
TCTTAGCCAC (SEQ ID NO: 259) ; 
TTAGCCACTT (SEQ ID NO: 261) ; 
AGCCACTTTT (SEQ ID NO: 263) ; 
CCACTTTTTA (SEQ ID NO: 265) ; 
ACTTTTTAAA (SEQ ID NO: 267) ; 
TTTTTAAAAG (SEQ ID NO: 269) ; 

TTTAAAAGAA (SEQ ID NO:271) ; 

TAAAAGAAAA (SEQ ID NO: 273) ; 

AAAGAAAAGG (SEQ ID NO: 275) ; 

AGAAAAGGGG (SEQ ID NO:277); 

AAAAGGGGGG (SEQ ID NO: 2 79) ; 

AAGGGGGGAC (SEQ ID NO: 2 81) ; 

GGGGGGACTG (SEQ ID NO: 283) ; 

GGGGACTGGA (SEQ ID NO: 285) ; 

GGACTGGAAG (SEQ ID NO:287) ; 

ACTGGAAGGG (SEQ ID NO:289); 

TGGAAGGGCT (SEQ ID NO:291); 

GAAGGGCTAA (SEQ ID NO: 2 93) ; 

AGGGCTAATT (SEQ ID NO: 295) ; 

GGCTAATTCA (SEQ ID NO: 297) ; 

CTAATTCACT (SEQ .ID NO: 299) ; 

AATTCACTCC (SEQ ID NO: 301) ; 

TTCACTCCCA (SEQ ID NO: 3 03) ; 

CACTCCCAAA (SEQ ID NO:305) ; 

CTCCCAAAGA (SEQ ID NO: 3 07) ; 

CCCAAAGAAG (SEQ ID NO:309); 

CAAAGAAGAC (SEQ ID NO:311) ; 

AAGAAGACAA (SEQ ID NO:313) ; 
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AGAAGACAAG (SEQ ID NO: 314); 
AAGACAAGAT (SEQ ZD NO:316) ; 
GACAAGATAT (SEQ ID NO: 318) ; 
CAAGATATCC (SEQ ID NO: 320) ; 
AGATATCCTT (SEQ ID NO: 322); 
ATATCCTTGA (SEQ ID NO: 324) ; 
ATCCTTGATC (SEQ ID NO: 326); 
CCTTGATCTG (SEQ ID NO: 328) ; 
TTGATCTGTG (SEQ ID NO: 330) ; 
GATCTGTGGA (SEQ ID NO: 332) ; 
TCTGTGGATC (SEQ ID NO: 334) ; 
TGTGGATCTA (SEQ ID NO: 336) ; 
TGGATCTACC (SEQ ID NO: 338) ; 
GATCTACCAC (SEQ ID NO: 340) ; 
TCTACCACAC (SEQ ID NO: 342) ; 
TACCACACAC (SEQ ID NO: 344) ; 
CCACACACAA (SEQ ID NO: 346) ; 
ACACACAAGG (SEQ ID NO: 348) ; 
ACACAAGGCT (SEQ ID NO: 350) ; 
ACAAGGCTAC (SEQ ID NO: 352) ; 
AAGGCTACTT (SEQ ID NO: 354) ; 
66CTACTTCC (SEQ ID NO: 356) ; 
CTACTTCCCT (SEQ ID NO: 358) ; 
ACTTCCCTGA (SEQ ID NO:360) ; 
TTCCCTGATT (SEQ ID NO: 362) ; 
CCCTGATTGG (SEQ ID NO: 3 64); 
CTGATTGGCA (SEQ ID NO:366); 
GATTGGCAGA (SEQ ID NO:368) / 
TTGGCAGAAC (SEQ ID NO: 370) ; 
GGCAGAACTA (SEQ ID NO: 372); 
CAGAACTACA (SEQ ID NO: 374); 
GAACTACACA (SEQ ID NO: 376) ; 
ACTACACACC (SEQ ID NO: 378) ; 
TACACACCAG (SEQ ID NO: 380) ; 
CACACCAGGG (SEQ ID NO: 3 82) ; 
CACCAGGGCC (SEQ ID NO:384); 
CCAGGGCCAG (SEQ ID NO:386) ; 
AGGGCCAGGG (SEQ ID NO:38B) ; 



GAAGACAAGA 


(SEQ 


ID 


NO:315) , 


AGACAAGATA 


(SEQ 


ID 


N0:317) ; 


ACAAGATATC 


(SEO 


ID 


NO: 319) ; 


AAGATATCCT 


(SEQ 


ID 


NO*321) • 


GATATCCTTG 


(SEQ 


ID 


NO:323) - 


TATCCTTGAT 


(SEQ 


ID 


NO:325) ; 


TCCTTGATCT 


(SEQ 


ID 


NO:327) ; 


CTTGATCTGT 


(SEQ 


ID 


NO:329) ; 


TGATCTGTGG 
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(SEQ 


ID 
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CTACACACCA 


(SEQ 


ID 


NO: 379) ; 


ACACACCAGG 


(SEQ 


ID 


NO:381) ; 


ACACCAGGGC 


(SEQ 


ID 


NO:383) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO:385) ; 


CAGG6CCAGG 


(SEQ 


ID 


NO:387) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO: 389) ; 
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GGCCAGGGGT (SEQ ID NO.-390) ; 
CCAGGGGTCA (SEQ ID NO:392) ; 

AGGGGTCAGA (SEQ ID NO: 394) ; 

GGGTCAGATA (SEQ ID NO: 396) ; 

GTCAGATATC (SEQ ID NO: 398) ; 

CAGATATCCA (SEQ ID NO.-400) ; 

GATATCCACT (SEQ ID NO: 4 02); 

TATCCACTGA (SEQ ID NO: 404); 

TCCACTGACC (SEQ ID NO: 406); 

CACTGACCTT (SEQ ID NO: 408) ; 

CTGACCTTTG (SEQ ID NO: 410); 

GACCTTTGGA (SEQ ID NO: 412) ; 

CCTTTGGATG (SEQ ID NO: 414) ; 

TTTGGATGGT (SEQ ID NO: 416) ; 

TGGATGGTGC (SEQ ID NO: 418) ; 

GATGGTGCTA (SEQ ID NO: 420) ; 

TGGTGCTACA (SEQ ID NO:422) ; 

GTGCTACAAG (SEQ ID NO: 424} ; 

GCTACAAGCT (SEQ ID NO: 42 6) ; 

TACAAGCTAG (SEQ ID NO: 42 8) ; 

CAAGCTAGTA (SEQ ID NO:430) ; 
AGCTAGTACC (SEQ ID NO:432) ; 

CTAGTACGAG (SEQ ID NO: 434) ; 
AGTACCAGTT (SEQ ID NO: 43 6) ; 

TACCAGTTGA (SEQ ID NO:438) ; 

CCAGTTGAGC (SEQ ID NO: 440) ; 
AGTTGAGCCA (SEQ ID NO: 442) ; 
TTGAGCCAGA (SEQ ID NO: 444); 
GAGCCAGATA (SEQ ID N0:446); 
GCCAGATAAG (SEQ ID N0:448) ; 
CAGATAAGGT (SEQ ID NO:450) ; 
GATAAGGTAG (SEQ ID NO: 452) ; 
TAAGGTAGAA (SEQ ID NO: 454) ; 
AGGTAGAAGA (SEQ ID NO: 456) ; 
GTAGAAGAGG (SEQ ID NO: 458) ; 
AGAAGAGGCC (SEQ ID N0:460) ; 
AAGAGGCCAA (SEQ ID NO:462) ; 
GAGGCCAATA (SEQ ID NO: 464) ; 
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GGCCAATAAA (SEQ ZO NO: 4 66) ; 
CCAATAAAGG (SEQ ID NO: 4 68) ; 
AATAAAGQAG (SEQ ID NO: 4 70) ; 
TAAAG6AGA8 (SEQ ID NO: 4 72) ; 
AAGGAGAGAA (SEQ ID NO: 4 74) ; 
GGAGAGAACA (SEQ ID N0:476) ; 
AGAGAACACC (SEQ ID NO: 4 78) ; 
AGAACACCAG (SEQ ID NO: 430) ; 
AACACCAGCT (SEQ ID NO: 482) ; 
CACCAGCTTG (SEQ ID NO: 4 84) ; 
CCAGCTTGTT (SEQ ID NO:486) ; 
AGCTTGTTAC (SEQ ID NO: 4 88) ; 
CTTGTTACAC (SEQ ID NO: 4 90); 
TGTTACACCC (SEQ ID NO: 492) ; 
TTACACCCTG (SEQ ID NO: 494) ; 
ACACCCTGTG (SEQ ID NO: 495) ; 
ACCCTGTGAG (SEQ ID NO: 4 98) ; 
CCTGTGAGCC (SEQ ID NO: 500) ; 
TGTGAGCCTG (SEQ ID NO: 502); 
TGAGCCTGCA (SEQ ID NO: 504) ; 
AGCCTGCATG (SEQ ID NO: 506) ; 
CCTGCATGGA (SEQ ID NO: 508) ; 
TGCATGGAAT (SEQ ID NO: 510); 
CATGGAATGG (SEQ ID NO: 512} t 
TGGAATGGAT (SEQ ID NO: 514) ; 
GAATGGATGA (SEQ ID NO: 516) ; 
ATGGATGACC (SEQ ID NO: 518); 
GGATGACCCT (SEQ ID NO:520); 
ATGACCCTGA (SEQ ID NO: 522); 
GACCCTGAGA (SEQ ID NO: 524) ; 
CCCTGAGAGA (SEQ ID NO: 526) ; 
CTGAGAGAGA (SEQ ID NO: 528) ; 
GAGAGAGAAG (SEQ ID NO: 530) ; 
GAGAGAAOTG (SEQ ID NO: 532) ; 
GAGAAGTGTT (SEQ ID NO: 534); 
GAAGTGTTAG (SEQ ID NO: 536) ; 
AGTGTTAGAG (SEQ ID NO: 53 8) ; 
TGTTAGAGTG (SEQ ID NO: 540) ; 
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ACAGCCGCCT 


(SEQ 


ID 


NO: 558} ; 


AGCCGCCTAG 


(SEQ 


ID 


NO: 560) ; 


CCGCCTAGCA 


(SEQ 


ID 


NO: 562) ; 


GCCTAGCATT 


(SEQ 


ID 


NO: 564) ; 


CTAGCATTTC 


(SEQ 


ID 


NO: 566) ; 


AGCATTTCAT 


(SEQ 


ID 


NO:568) ; 


CATTTCATCA 


(SEQ 


ID 


NO: 570) ; 


TTTCATCACQ 


(SEQ 


ID 


NO: 572) ; 


TCATCACGTG 


(SEQ 


ID 


NO: 574) ; 


ATCACGTGGC 


(SEQ 


ID 


NO: 576) ; 


CACGTGGCCC 


(SEQ 


ID 


NO: 578) ; 


CGTGGCCCGA 


(SEQ 


ID 


NO:580) ; 


TGGCCCGAGA 


(SEQ 


ID 


NO: 582) ; 


GCCCGAGAGC 


(SEQ 


ID 


NO: 584) ; 


CCGAGAGCTG 


(SEQ 


ID 


NO: 586) ; 


GAGAGCTGCA 


(SEQ 


ID 


NO: 58 8) ; 


GAGCTGGATC 


(SEQ 


ID 


NO: 590) ; 


GCTGCATCCG 


(SEQ 


ID 


NO: 592) ; 


TGCATCCGGA 


(SEQ 


ID 


NO: 594) ; 


CATCCGGAGT 


{SEQ 


ID 


NO: 596) ; 


TCOGGAGTAC 


{SEQ 


ID 


NO: 598) ; 


CGGAGTACTT 


(SEQ 


ID 


NO: 600) ; 


GAGTACTTCA 


(SEQ 


ID 


NO: 602) ; 


GTACTTCAAG 


(SEQ 


ID 


NO: 604) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO:606) ; 


TTCAAGAACT 


(SEQ 


ID 


NO:608) ; 


CAAGAACTGC 


(SEQ 


ID 


NO: 610) ; 


AGAACTGCTG 


(SEQ 


ID 


NO: 612) ; 



TAGA6TGGAG 


(SEQ 


ID 


NO: 543) ; 


GAGTGGAGGT 


(SEQ 


ID 


NO: 545) ; 


GTGGAGGTTT 


(SEQ 


ID 


NO: 547) ; 


GGAGGTTTGA 


(SEQ 


ID 


NO: 549) ; 


AGGTTTGACA 


(SEQ 


ID 


NO: 551) ; 


OTTTGACAGC 


(SEQ 


ID 


NO: 553) ; 


TTGACAGCCG 


(SEQ 


ID 


NO: 555) ; 


GACAGCCGCC 


(SEQ 


ID 


NO:557) ; 


CAGCCGCCTA 


(SEQ 


ID 


NO: 559) ; 


GCCGCCTAGC 


(SEQ 


ID 


NO: 561) ; 


CGCCTAGCAT 


(SEQ 


ID 


NO: 563) ; 


CCTAGCATTT 


(SEQ 


ID 


NO: 565) ; 


TAGCATTTCA 


(SEQ 


ID 


NO: 567) ; 


GCATTTCATC 


(SEQ 


ID 


NO: 569) ; 


ATTTCATCAC 


(SEQ 


ID 


NO: 571) ; 


TTCATCACGT 


(SEQ 


ID 


NO: 573) ; 


CATCACGTGG 


(SEQ 


ID 


NO: 575) ; 


TCACGTGGCC 


(SEQ 


ID 


NO: 577) ; 


ACGTGGCCCG 


(SEQ 


ID 


NO: 579) ; 


GTGGCCCGAG 


(SEQ 


ID 


NO:581) ; 


GGCCCGAGAG 


(SEQ 


ID 


NO:583) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO:585) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO: 587) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO: 589) ; 


AGCTGCATCC 


(SEQ 


ID 


NO: 591) ; 


CTGCATCCGG 


(SEQ 


ID 


NO: 593) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 595) ; 


ATCCGGAGTA 


(SEQ 


ID 


NO: 597) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 599) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 601) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 603) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 605) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 607) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 609} ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 611} ; 


GAACTGCTGA 


(SEQ 


ID 


NO: 613) . 
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33. A method according to claim 27 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HIV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID 


NO: 652) 


. CTTTTTGCCT 




ID 


aju : oo o j ; 


TTTTTGCCTG 


(SEQ 


ID 


NO: 654} 


. TTTTGCCTGT 


V5EQ 


Tn 
1U 


NO:o55J ; 


TTTGCCTGTA 


(SEQ 


ID 


NO: 656) 


• TTGCCTGTAC 


(SEQ 


ID 


NO: 657) ; 


TGCCTGTACT 


(SEQ 


ID 


NO: 658) 


; GCCTGTACTG 


(SEQ 


ID 


NO:659) ; 


CCTGTACTGG 


(SEQ 


ID 


NO: 660) 


■ CTGTACTGGG 


(SEQ 


ID 


NO:661) ; 


TGTACTGGGT 


(SEQ 


ID 


NO: 662) 


r GTACTGGGTC 


(SEQ 


ID 


NO: 663) ; 


TACTGGGTCT 


(SEQ 


ID 


NO: 664) , 


f ACTGGGTCTC 


(SEQ 


ID 


NO: 665 J ; 


CTGG6TCTCT 


(SEQ 


ID 


NO: 666) ; TGGGTCTCTC 


(SEQ 


ID 


NO: 667) ; 


GGGTCTCTCT 


(SEQ 


ID 


NO:668) ; GGTCTCTCTG 


(SEQ 


ID 


NO: 669) ; 


GTCTCTCTGG 


(SEQ 


ID 


NO: 670) ; TCTCTCTGGT 


(SEQ 


ID 


NO: 671) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO: 672) ; TCTCTGGTTA 


(SEQ 


ID 


NO: 673) ; 


CTCTGGTTAG 


(SEQ 


ID 


NO: 674) ; TCTCTGGTTA 


(SEQ 


ID 


NO: 675) ; 


CTGGTTAGAC 


(SEQ 


ID 


NO: 676) ; TGGTTAGACC 


(SEQ 


ID 


NO: 677) ; 


GGTTAGACCA 


(SEQ 


ID 


NO: 678} ; GTTAGACCAG 


(SEQ 


ID 


NO: 679) ; 


TTAGACCAGA 


(SEQ 


ID 


NO : 68 0 ) ; TAGACCAGAT 


(SEQ 


ID 


NO: 681) ; 


AGACCAGATC 


(SEQ 


ID 


NO: 682) ; GACCAGATCT 


(SEQ 


ID 


NO: 683) ; 


ACCAGATCTG 


(SEQ 


ID 


NO: 684) ; CCAGATCTGA 


(SEQ 


ID 


NO: 685) ; 


CAGATCTGAG 


(SEQ 


ID 


NO : 686 ) ; AGATCTGAGC 


(SEQ 


ID 


NO: 687) ; 


GATCTGAGCC 


(SEQ 


ID 


NO: 688) ; ATCTGAGCCT 


(SEQ 


ID 


NO: 689) ; 


TCTGAGCCTG 


(SEQ 


ID 


NO: 690) , 


CTGAGCCTGG 


(SEQ 


ID 


NO: 691) ; 


TGAGCCTGGG 


(SEQ 


ID 


NO: 692) ; 


GAGCCTGGGA 


(SEQ 


ID 


NO: 693) ; 


AGCCTGGGAG 


(SEQ 


ID 


NO: 694) ; 


- GCCTGGGAGC 


(SEQ 


ID 


NO: 695); 


CCTGGGAGCT 


(SEQ 


ID 


NO: 696) ; 


CTGGGAGCTC 


(SEQ 


ID 


NO: 697) ; 


TGGGAGCTCT 


(SEQ 


ID 


NO:698) j 


GGGAGCTCTC 


(SEQ 


ID 


NO: 699) ; 


GQAGCTCTCT 


(SEQ 


ID 


NO: 700) j 


QAGCTCTCTG 


(SEQ 


ID 


NO: 701) j 


AGCTCTCTGG 


(SEQ 


ID 


NO: 702) ; 


GCTCTCTGGC 


(SEQ 


ID 


NO: 703) ; 


CTCTCTGGCT 


(SEQ 


ID 


NO: 704) , 


■ TCTCTGGCTA 


(SEQ 


ID 


NO: 705) ; 


CTCTGGCTAA 


(SEQ 


ID 


NO: 706) , 


• TCTGGCTAAC 


(SEQ 


ID 


NO: 707) ; 


CTGGCTAACT 


(SEQ 


ID 


NO: 708) , 


; TGGCTAACTA 


(SEQ 


ID 


NO:709) ; 


GGCTAACTAG 


(SEQ 


ID 


NO: 710) , 


■ GCTAACTAGG 


(SEQ 


ID 


NO: 711) ; 


CTAACTAGGG 


(SEQ 


ID 


NO: 712) , 


■ TAACTAGGGA 


(SEQ 


ID 


NO: 713); 


AACTAGGGAA 


(SEQ 


ID 


NO: 7 14) , 


■ ACTAGGGAAC 


(SEQ 


ID 


NO: 715) ; 


CTAGGGAACC 


(SEQ 


ID 


NO: 716) , 


: TAGGGAACCC 


(SEQ 


ID 


N0:717) ; 


AGGGAACCCA 


(SEQ 


ID 


NO: 718) 


: GGGAACCCAC 


(SEQ 


ID 


N0:719) ; 
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GGAACCCACT 

AACCCACTGC 

CCCACTGCTT 

CACTGCTTAA 

CTGCTTAAGC 

GCTTAAGCCT 

TTAAGCCTCA 

AAGCCTCAAT 

GCCTCAATAA 

CTCAATAAAG 

CAATAAAGCT 

ATAAAGCTTG 

AAAGCTTGCC 

AGCTTGCCTT 

CTTGCCTTGA 

TGCCTTGAGT 

CCTTGAGTGC 

TTGAGTGCTT 

QAGTGCTTCA 

GTGCTTCAAG 

GCTTCAAGTA 

TTCAAGTAGT 

CAAGTAGTGT 

AGTAGTGT6T 

TAGTGTGTGC 

GTGTGTGCCC 

6TGTGCCC6T 

GTGCCCGTCT 

GCCCGTCTGT 

CC6TCT6TT6 

GTCTQTTGTO 

CTGTTGTQT6 

GTTGTGTGAC 

TGTGTGACTC 

TGTGTGACTC 

TGTGACTCTG 

TGACTCTGGT 

ACTCTGGTAA 



(SEQ ID NO: 720) ; 
(SEQ ID NO: 722) ; 
(SEQ ID NO: 724); 
(SEQ ID NO: 726); 

(SEQ ID NO: 728) ; 

(SEQ ID NO:730) ; 

(SEQ ID NO:732) ; 

(SEQ ID NO: 734) ; 

(SEQ ID NO: 736) ; 

(SEQ ID NO: 738) ; 

(SEQ ID NO: 740) ; 

(SEQ ID NO: 742) ; 

(SEQ ID NO: 744) ; 

(SEQ ID NO: 746); 

(SEQ ID NO: 748) ; 

(SEQ ID NO: 750) ; 

(SEQ ID NO: 752) ; 

(SEQ ID NO: 754) ; 

(SEQ ID NO: 756) ; 

(SEQ ID NO: 758) ; 

(SEQ ID NO: 760) ; 

(SEQ ID NO: 762) ; 

(SEQ ID NO: 764) ; 

(SEQ ID NO: 766) ; 

(SEQ ID NO: 768) ; 

(SEQ ID NO: 770); 
(SEQ ID NO: 772) ; 
(SEQ ID NO: 774) ; 
(SEQ ID NO: 776) ; 
(SEQ ID NO: 778) ; 
(SEQ ID NO: 780) ; 
(SEQ ID NO: 782) ; 
(SEQ ID NO: 784) ; 
(SEQ ID NO: 786) ; 
(SEQ ID NO: 78 8) ; 
(SEQ ID NO: 790) ; 
(SEQ ID NO: 792) ; 
(SEQ ID NO: 794) ; 



GAACCCACTG 


(SEQ 


ID NO: 721) 


ACCCACTGCT 


(SEQ 


ID NO: 723) 


CCACTGCTTA 


(SEQ 


ID NO: 725) 


ACTGCTTAAG 


(SEQ 


ID NO: 727) 


TGCTTAAGCC 


(SEQ 


ID NO: 729) 


CTTAAGCCTC 


(SEQ 


ID NO: 731) 


TAA6CCTCAA 


(SEQ 


ID NO:733) 


A6CCTCAATA 


(SEQ 


ID NO: 735) 


CCTCAATAAA 


(SEQ 


ID NO: 737) 


TCAATAAAOC 


(SEQ 


ID NO: 739) 


AATAAAGCTT 


(SEQ 


ID NO: 741) 


TAAAGCTT6C 


(SEQ 


ID NO:743) 


AAGCTTGCCT 


(SEQ 


ID NO: 745) 


GCTTGCCTTG 


(SEQ 


ID NO: 747) 


TTGCCTTGAG 


(SEQ 


ID NO: 749) 


GCCTTGAGTG 


(SEQ 


ID NO: 751) 


CTTGAGTGCT 


(SEQ 


ID NO:753) , 


TGAGTGCTTC 


(SEQ 


ID NO: 755) , 


AGTGCTTCAA 


(SEQ 


ID Np : 757) , 


TGCTTCAAGT 


(SEQ 


ID NO: 759) , 


CTTCAAGTAG 


(SEQ 


ID NO: 761) ; 


TCAAGTAGTG 


(SEQ 


ID NO: 763) j 


AAGTAGTGTG 


(SEQ 


ID NO: 765) ; 


GTAGTGTGTG 


(SEQ 


ID NO: 767) j 


AGTGTGTGCC 


(SEQ 


ID NO: 769) ; 


TGTGTGCCCG 


(SEQ 


ID NO: 771) ; 


TGTGCCGGTC 


(SEQ 


ID NO: 773) ; 


TGCCCGTCTG 


(SEQ 


ID NO: 775) ; 


CCCGTCTGTT 


(SEQ 


ID NO: 777) ; 


CGTCTGTTGT 


(SEQ 


ID NO: 779) ; 


TCTGTTGTGT 


(SEQ 


ID NO: 781) ; 


TGTTGTGTGA 


(SEQ 


ID NO: 783) ; 


TTGTGTGACT 


(SEQ 


ID NO: 785) ; 


GTGTGACTCT 


(SEQ 


ID NO: 787) ; 


GTGTGACTCT 


(SEQ 


ID NO: 789) ; 


GTGACTCTGG 


(SEQ 


ID NO: 791) ; 


GACTCTGGTA 


(SEQ 


ID NO: 793) ; 


CTCTGGTAAC 


(SEQ 


ID NO: 795) ; 
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TCTGGTAACT (SEQ ID NO: 796); CTGGTAACTA (SEQ ID NO:797); 

TGGTAACTAG (SEQ ID NO:798); GGTAACTAGA (SEQ ID NO:799) . 

34. A method according to claim 24 wherein said HIV-1 isolate has the identifying 
characteristics of HIV-1 isolate ECACC V94101706 or HIV-1 isolate ECACC 
V941031 169 or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number . 

35. A method according to claim 20 wherein the HIV-1 isolate is reactive to 
antibodies to a glycoprotein of HIV-1, is capable of inducing an immune response of 
gag, pol and/or em and which is incapable of directing synthesis of a nef gene product 
or a full length nef gene product 

36. A method according to claim 35 wherein the glycoprotein is at least one of gp41- 
45, gpl20 and/or gp!60. 

37. A method for vaccinating an individual against the development of AIDS or 
AIDS related diseases, said method comprising administering to said individual a non- 
pathogenic isolate of HIV-1 in an amount effective to infect target cells and to generate 
target cells carrying DNA derived from said non-pathogenic HIV-1, 

38. A method according to claim 37 wherein said isolate is capable of stimulating 
an immune response in a human or primate against at least one glycoprotein on HTV-1 
whilst not substantially reducing proliferative responses and cytokine production to a 
mitogen in said human or primate compared to a healthy, non-infected human or primate 
subject 

39. A method according to claim 38 wherein said isolate carries one or more 
mutations in its genome resulting in the inability to direct synthesis of at least one 
polypeptide or protein from a pathogenic strain of HTV-1, 



WO 95/21912 



-212- 



PCT/ATJ95/00063 



40. A method according to claim 17 wherein said isolate earned one or more 
mutations, in its genome resulting in said genome directing synthesis of a truncated form 
of a polypeptide or protein from a pathogenic strain of HTV-1. 

41. A method according to claim 39 or 40 wherein said HTV-1 isolate carries a 
mutation in its nef gene and/or long terminal repeat (LTR) region or in a functionally 
equivalent location in the HTV-1 genome. 

42. A method according to claim 40 wherein said HTV-1 isolate is reactive to 
antibodies to gp41-45> gpl20 and/or gpl60 of HTV-1 and carries a deletion of at least 
ten contiguous nucleotides in a region corresponding to the nef gene and/or LTR region 
of pathogenic HIV-1. 

43. A method according to claim 42 wherein said HIV-1 isolate is capable of 
inducing an immune response to at least one of gag, pol and/or env. 

44. A method according to claim 43 wherein said HTV-1 isolate carries a deletion in 
its genome of at least 10 nucleotides within the region from nucleotide 87S7 to 
nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

45. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (I) 8830-8862; 

(ii) 9009-9035; 
(in) 9019-9029; and 

(iv) 9033-9049. 

46. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (v) 9281-9371; 

(vi) 9281-9362; 
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(vii) 9105-9224; and 

(viii) 9271-9370. 

47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (ft) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(rii) 9112-9204. 

47. A method according to claim 44 wherein said isolate carries a deletion in its 
genome of at least 10 nucleotides within a region selected from the list consisting of: 
nucleotide (xiil) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



48. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the nef gene of HIV- 1 strain NL43: 



ATGGGTGGCA 


(SEQ 


ID 


NO:2) ; 


TGGGTGGCAA 


(SEQ ID NO: 3) ; 


GGGTGGCAAG 


(SEQ 


ID 


NO:4) ; 


GGTGGGAAGT 


(SEQ ID NO: 5) ; 


GTQGCAAGTG 


(SEQ 


ID 


NO: 6) ; 


TGGCAAGTGG 


(SEQ ID NO: 7) ; 


GGCAAQTGGT 


(SEQ 


ID 


NO: 8) ; 


GCAAGTGGTC 


(SEQ ID NO: 9) ; 


CAAGTGGTCA 


(SEQ 


ID 


NO: 10) ; AAGTGGTCAA 


(SEQ 


ID NO.-ll); 


AGTGGTCAAA 


(SEQ 


ID 


NO: 12); GTGGTCAAAA 


(SEQ 


ID NO:13); 


TGGTCAAAAA 


(SEQ 


ID 


NO: 14) ; 


GGTCAAAAAG 


(SEQ 


ID NO:15); 


GTCAAAAAGT 


(SEQ 


ID 


NO: 16); TCAAAAAGTA 


(SEQ 


ID NO: 17); 


CAAAAAGTAG 


(SEQ 


ID 


NO: 18) j 


AAAAAGTAGT 


(SEQ ID NO: 19) ; 


AAAAGTAGTG 


(SEQ 


ID 


NO:20) i 


AAAGTAGTGT 


(SEQ 


ID NO: 21) ; 


AAGTAGTGTG 


(SEQ 


ID 


NO: 22) , 


AGTAGTGTGA 


(SEQ 


ID NO:23) ; 


GTAGTGTGAT 


(SEQ 


ID 


NO: 24) , 


r TAGTGTGATT 


(SEQ ID NO: 25) ; 


AGTGTGATTG 


(SEQ 


ID 


NO:26) , 


. GTGTGATTGG 


(SEQ 


ID NO:27); 


TGTGATTGGA 


(SEQ 


ID 


NO:28) , 


. GTGATTGGAT 


(SEQ 


ID NO:29); 


TGATTGGATG 


(SEQ 


ID 


NO:30) , 


. GATTGGATGG 


(SEQ 


ID NO:31); 


ATTGGATGGC 


(SEQ 


ID 


NO: 32) , 


. TTGGATGGCC 


(SEQ 


ID NO:33); 


TGGATGGCCT 


(SEQ 


ID 


NO: 34) , 


? GGATGGCCTG 


(SEQ 


ID NO: 35); 
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GATGGCCTGC (SEQ ID NO: 36) ; 
TGGCCTGCTG (SEQ ID NO:38) ; 
GCCTGCTGTA (SEQ ID NO:40) ; 
CTGCTGTAAG (SEQ ID NO:42) ; 
GCTGTAAGGG (SEQ ID NO: 44); 
TGTAAGGGAA (SEQ ID NO:46); 
TAAGGGAAAG (SEQ ID NO:48) ; 
AGGGAAAGAA (SEQ ID NO:50); 
GGAAAGAATG (SEQ ID NO: 52) ; 
AAAGAATGAG (SEQ ID NO: 54) ; 
AGAATGAGAC (SEQ ID NO:56); 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC (SEQ ID NO: 60); 
AGACGAGCTG (SEQ ID NO: 62); 
AOGAGCTGAG (SEQ ID NO: 64) ; 
GAGCTGAGCC (SEQ ID NO:66); 
GCTGAGCCAG (SEQ ID NO:68) ; 
TGAGCCAGCA (SEQ ID NO: 70); 
AGCCAGCAGC (SEQ ID NO: 72); 
CCAGCAGCAG (SEQ ID NO: 74); 
AGCAGCAGAT (SEQ ID NO: 76); 
CAGCAGATGG (SEQ ID NO: 78) ; 
GCAGATGGGG (SEQ ID NO: 80); 
AGATGGGGTG (SEQ ID NO: 82); 
ATGGGGTGGG (SEQ ID NO: 84) ; 
GGGGTGGGAG (SEQ ID NO:86); 
GGTGGGAGCA (SEQ ID NO:88); 
TGGGAGCAGT (SEQ ID NO: 90); 
GGAGCAGTAT (SEQ ID NO:92); 
AGCAGTATCT (SEQ ID NO: 94) ; 
CAGTATCTCG (SEQ ID NO:96); 
GTATCTCGAG (SEQ ID NO: 98) ; 
ATCTCGAGAC (SEQ ID NO: 100) ; 
CTCGAGACCT (SEQ ID NO:102) ; 
CGAGACCTAG (SEQ ID NO:104) ; 
AGACCTAGAA (SEQ ID NO:106) ; 
ACCTAGAAAA (SEQ ID NO: 108) ; 
CTAGAAAAAC (SEQ ID NO:110); 



ATGGCCTGCT 


(SEQ 


ID 


NO: 37) ; 


GGCCTGCTGT 


(SEQ 


ID 


NO:39) ; 


CCTGCTGTAA- 


(SEQ 


ID 


NO:41) ; 


TGCTGTAAGG 


(SEQ 


ID 


NO:43) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO:45) ; 


GTAAGGGAAA 


(SEQ 


ID 


NO:47) ; 


AAGGGAAAGA 


(SEQ 


ID 


NO: 49) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO:51) ; 


GAAAGAATGA 


(SEQ 


ID 


NO: 53) ; 


AAGAATGAGA 


(SEQ 


ID 


NO* 55) * 


GAATGAGACG 


(SEQ 


ID 




ATGAGACGAG 


(SEO 


ID 


wo ■ 5 q ) • 


GAGACGAGCT 


(SEO 


ID 


WO • G 1 ) • 


GACGAGCTGA 


(SEQ 


ID 


NO* £3) • 


CGAGCTGAGC 


(SEO 


ID 


NO -65) • 


AGCTGAGCCA 


(SEO 


ID 


NO* 67) -> 


CTGAGCCAGC 


(SEO 


ID 


NO- 69) • 


GAGCCAGCAG 


(SEO 


ID 


NO- 71 ) • 


GCCAGCAGCA 


(SEO 


ID 


WO - 71 ) • 


CAGCAGCAGA 


(SEO 


ID 


wn • 7R\ . 


GCAGCAGATG 


(SEO 


ID 


NO ■ 77) • 


AGCAGATGGG 

• m^m ^^mm^mmm A WWW 


(SEO 


TD 

JL JL/ 


NO ■ 7Q S • 


CAGATGGGGT 


(SEO 


ID 


no • ai v » 


GATGGGGTGG 


(SEO 


ID 


WO* M ) * 


TGGGGTGGGA 


(SEO 


ID 


NO- flR) • 


GGGTGGGAGC 

-«* v*j«*v a V WW>*AWJ 


(SEO 


ID 


WO* ft7) • 


GTGGGAGCAG 


(SEQ 


ID 


NO: 89) ; 


GGGAGGAGTA 


(SEQ 


ID 


NO: 91) ; 


GAGCAGTATC 


(SEQ 


ID 


NO:93) ; 


GCAGTATCTC 


(SEQ 


ID 


NO: 95) ; 


AGTATCTCGA 


(SEQ, 


ID 


NO: 97) ; 


TATCTCGAGA 


(SEQ 


ID 


NO: 99) ; 



TCTCGAGACC (SEQ ID NO:101) ; 
TCSQAGACCTA (SEQ ID NO: 103); 
GAGACCTAGA (SEQ ID NO:105) ; 
GACCTAGAAA (SEQ ID NO:107) ; 
CCTAGAAAAA (SEQ ID NO:109) ; 
TAGAAAAACA (SEQ ID NO:lll) ; 
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AGAAAAACAT 


(SEQ 


ID 


NO: 112) 


GAAAAACATG 


(SEQ 


ID 


N0:113) 


AAAAACATGG 


(SEQ 


ID 


NO: 114) 


; AAAACATGGA 


(SEQ 


ID 


NO: 115) 


AAACATGGAG 


(SEQ 


ID 


NO: 116) 


AACATGGAGC 


(SEQ 


ID 


N0:117) 


ACATGGAGCA 


(SEQ 


ID 


NO: 118) 


CATGGAGCAA 


(SEQ 


ID 


NO:119) 


ATGGAGCAAT 


(SEQ 


ID 


NO: 120) 


{ TGGAGCAATC 


(SEQ 


ID 


NO:121) 


GGAGCAATCA 


(SEQ 


ID 


NO: 122) 


; GAGCAATCAC 


(SEQ 


ID 


NO:123) 


AGCAATCACA 


(SEQ 


ID 


NO: 124) 


} GCAATCACAA 


(SEQ 


ID 


NO:125) 


CAATCACAAG 


(SEQ 


ID 


NO: 126) 


; AATCACAAGT 


(SEQ 


ID 


N0:127) 


ATCACAAGTA 


(SEQ 


ID 


NO: 128) , 


; TCACAAGTAG 


(SEQ 


ID 


NO:129) 


CACAAGTAGC 


(SEQ 


ID 


NO:130) , 


} ACAAGTAGCA 


(SEQ 


ID 


N0:131) 


CAAGTAGCAA 


(SEQ 


ID 


NO: 132) , 


; AAGTAGCAAT 


(SEQ 


ID 


NO:133) 


AGTAGCAATA 


(SEQ 


ID 


NO: 134) , 


. GTAGCAATAC 


(SEQ 


ID 


N0:135) 


TAGCAATACA 


(SEO 


ID 


NO: 136) , 


; AGCAATACAG 


(SEQ 


ID 


N0:137) 


GCAATACAGC 


(SEO 


ID 


NO: 138) , 


( CAATACAGCA 


(SEO 

> w**W 


ID 


NO:139) 


AATACAGCAG 


(SEO 

\ w*-*W 


ID 


NO: 140); ATACAGCAGC 


(SEQ 

* www 


ID 


NO: 141) 


TACAGCAGCT 


(SEQ 


ID 


NO: 142) ; ACAGCAGCTA 


(SEQ 


ID 


NO:143) 


CAGCAGCTAA 


(SEQ 


ID 


NO: 144); AGCAGCTAAC 


(SEQ 


ID 


NO: 145) 


GCAGCTAACA 


(SEQ 


ID 


NO: 146) ; CAGCTAACAA 


(SEQ 


ID 


NO: 147) 


AGCTAACAAT 


(SEQ 


ID 


NO: 148) ; 


GCTAACAATG 


(SEQ 


ID 


N0:149) 


CTAACAATGC 


(SEQ 

\ www 


ID 


NO: 150) ; 


• TAACAATGCT 


(SEQ 


ID 


N0:151) 


AACAATGCTG 


(SEO 

\ W*JTW 


ID 


NO: 152) ; 


ACAATGCTGC 


(SEQ 


ID 


N0:153) 


CAATGCTGCT 

W***»X WW * %*W X 


(SEO 

\ way 


ID 
*** 


NO: 154) j 


AATGCTGCTT 


(SEQ 


ID 


NO: 155) 


ATGCTGCTTTG 

«%XwWX>7W X X V7 


(SEO 

I W*->)f 


ID 


NO: 156) i 


TGCTGCTTGT 


(SEQ 

\ w»w 


ID 


NO:157) 


ww X W X X V X w 


(SEO 

\wCpU 


ID 

XX/ 


NO: 158) j 


CTGCTTGTGC 


(SEQ 


ID 


NO:159) 


X ww X X w X www 


(SEO 


ID 

XX/ 


NO:160) ; 


GCTTGTGCCT 


(SEQ 

\ w*aw 


ID 


NO: 161) 


wX X w X www X w 


(SEO 


ID 

XX/ 


NO: 162) j 


TTGTGCCTGG 


(SEO 

I WWW 


ID 


NO:163) 


X wx www 4. www 




ID 


NO: 164) ; 


GTGCCTGGCT 


(SEO 


ID 


N0:165) 


X www A www* A 


fog A 


TD 

XX/ 


NO: 166) j GCCTGGCTAG 


(SEO 


ID 


ND:167) 


ww x www x Awn 


(SEO 


ID 

XX/ 


NO: 168); CTGGCTAGAA 


(SEO 

\ »X"W 


ID 


NO:l€9) 


TGGCTAGAAG 


(SEO 


ID 


NO : 170 ) ; GGCTAGAAGC 


(SEQ 


ID 


N0:171) 


GCTAGAAGCA 


(SEQ 


ID 


NO : 172 ) ; CTAGAAGCAC 


(SEQ 


ID 


NO: 173) 


inucnnuWlwi 




XX/ 


NO: 174) i 


AGAAGCACAA 


(SEO 


ID 


NO:175) 


GAAGCACAAG 


(SEQ 


ID 


NO: 176) ; 


AAGCACAAGA 


(SEQ 


ID 


N0:177) 


AGCACAAGAG 


(SEQ 


ID 


NO: 178) j 


GCACAAGAGG 


(SEQ 


ID 


NO:179) 


CACAAGAGGA 


(SEQ 


ID 


NO: 180) j 


ACAAGAGGAG 


(SEQ 


ID 


NO: 181) 


CAAGAGGAGG 


(SEQ 


ID 


NO: 182) i 


AAGAGGAGGA 


(SEQ 


ID 


N0:183) 


AGAGGAGGAA 


(SEQ 


ID 


NO: 184) i 


GAGGAGGAAG 


(SEQ 


ID 


NO:185) 


AGGAGGAAGA 


(SEQ 


ID 


NO: 186) , 


GGAGGAAGAG 


(SEQ 


ID 


N0:187) 
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GAGGAAGAGG (SEQ ID NO:188); 
GGAAGAGGTG (SEQ ID NO: 190); 
AAGAGGTGGG (SEQ ID NO:192); 
GAGGTGGGTT (SEQ ID N0:194) ; 
GGTGGGTTTT (SEQ ID NO:196) ; 

TGGGTTTTCC (SEQ ID NO: 198) ; 

GGTTTTCCAG (SEQ ID NO:200) ; 

TTTTCCAGTC (SEQ ID NO: 202) ; 

TTCCAGTCAC (SEQ ID NO: 204) ; 

CCAGTCACAC (SEQ ID NO: 206) ; 

AGTCACACCT (SEQ ID NO: 208); 

TCACACCTCA (SEQ ID NO:210); 

ACACCTCAGG (SEQ ID NO: 212) ; 

ACCTCAGGTA (SEQ ID NO: 214); 

CTCAGGTACC (SEQ ID NO: 216) ; 

CAGGTACCTT (SEQ ID NO: 218) ; 

GGTACCTTTA (SEQ ID NO:220) ; 

TACCTTTAAG (SEQ ID NO: 222) ; 

CCTTTAAGAC (SEQ ID NO: 224) ; 

TTTAAGACCA (SEQ ID NO-226) ; 

TAAGACCAAT (SEQ ID NO: 228); 
AGACCAATGA (SEQ ID NO:230) ; 
ACCAATGACT (SEQ ID NO:232) ; 
CAATGACTTA (SEQ ID NO:234); 
ATGACTTACA (SEQ ID NO: 236) ; 
GACTTACAAG (SEQ ID NO:238) ; 
CTTACAAGGC (SEQ ID NO:240) ; 
TACAAGGCAG (SEQ ID NO:242) ; 
CAAGGCAGCT (SEQ ID NO: 244) ; 
AGGCAGCTGT (SEQ ID NO:246) ; 
GCAGCTGTAG (SEQ ID NO:248) ; 
AGCTGTAGAT (SEQ ID NO:250) ; 
CTGTAGATCT (SEQ ID NO:252) ; 
GTAGATCTTA (SEQ ID NO: 254) ; 
AGATCTTAGC (SEQ ID NO:256); 
ATCTTAGCCA (SEQ ID NO:258); 
CTTAGCCACT (SEQ ID NO:260); 
TAGCCACTTT (SEQ ID NO: 262); 
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AGGAAGAGGT (SEQ ID NO: 189) 
GAAGAGGTGG (SEQ ID NO: 191) 
AGAGGTGGGT (SEQ ID NO: 193) 
AGGTGGGTTT (SEQ ID NO: 195) 
GTGGGTTTTC (SEQ ID NO: 197) 
GGGTTTTCCA (SEQ ID NO: 199). 
GTTTTCCAGT (SEQ ID NO: 2 01) 
TTTCCAGTCA (SEQ ID NO: 203) 
TCCAGTCACA (SEQ ID NO:205) 
CAGTCACACC (SEQ ID NO: 207) 
GTCACACCTC (SEQ ID NO: 209) 
CACACCTCAG (SEQ ID N0:211) 
CACCTCAGGT (SEQ. ID NO: 213), 
CCTCAGGTAC (SEQ ID N0:215)j 
TCAGGTACCT (SEQ ID NO:217) j 
AGGTACCTTT (SEQ ID NO: 219) j 
GTACCTTTAA (SEQ ID NO: 221) j 

ACCTTTAAGA (SEQ ID NO: 223) ; 
CTTTAAGACC (SEQ ID NO:225) ; 

TTAAGACCAA (SEQ ID NO: 227) ; 

AAGACCAATG (SEQ ID NO: 229) ; 

GACCAATGAC (SEQ ID NO:231) ; 

CCAATGACTT (SEQ ID NO: 233) ; 

AATGACTTAC (SEQ ID NO: 23 5) ; 

TGACTTACAA (SEQ ID NO:237) ; 

ACTTACAAGG (SEQ ID NO:239); 

TTACAAGGCA (SEQ ID NO: 241) ; 

ACAAGGCAGC (SEQ ID NO: 243) ; 

AAG6CAGCTO (SEQ ID NO:245); 

GGCAGCTGTA (SEQ ID NO: 247) ; 

CAGCTGTAGA (SEQ ID NO: 249) ; 

GCTGTAGATC (SEQ ID NO: 251) ; 

TGTAGATCTT (SEQ ID NO:253) ; 

TAGATCTXAG (SEQ ID NO:255) ; 

GATCTTAGCC (SEQ ID NO:257) ; 

TCTTAGCCAC (SEQ ID NO:259) ; 

TTA6CCACTT (SEQ ID NO:261) ; 
A6CCACTTTT (SEQ ID NO:263) ; 
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GCCACTTTTT 


(SEQ 


ID 


NO:264) ; 


CACTTTTTAA 


{SEQ 


ID 


NO:266) ; 


CTTTTTAAAA 


(SEQ 


ID 


NO: 268) ; 


TTTTAAAAGA 


(SEQ 


ID 


N0:270) ; 


TTAAAAGAAA 


(SEQ 


ID 


NO: 272) ; 


AAAAGAAAAG 


(SEQ 


ID 


NO: 274) ; 


AAGAAAAGGG 


(SEQ 


ID 


NO:276) ; 


GAAAAGGGGG 


(SEQ 


ID 


NO: 278) ; 


AAAGGGGGGA 


(SEQ 


ID 


NO:2S0) ; 


AGGGGGGACT 


(SEQ 


ID 


NO: 282) ; 


GGGGGACTGG 


(SEQ 


ID 


N0:284) ; 


GGGACTGGAA 


(SEQ 


ID 


NO: 286) ; 


GACTGGAAGG 


(SEQ 


ID 


NO: 288) ; 


CTGGAAGGGC 


(SEQ 


ID 


N0:290) ; 


GGAAGGGCTA 


(SEQ 


ID 


N0:292) ; 


AAGGGCTAAT 


(SEQ 


ID 


N0:294) ; 


GGGCTAATTC 


(SEQ 


ID 


N0:296) ; 


GCTAATTCAC 


(SEQ 


ID 


NO:298) ; 


TAATTCACTC 


(SEQ 


ID 


NO:300) ; 


ATTCACTCCC 


(SEQ 


ID 


NO:302) ; 


TCACTCCCAA 




TD 


NO* 3 04) i 






TH 
XL/ 


wn • *x ne \ • 






Tn 
xu 


ciw : j us / ; 








NO -3 10) • 

Vt\J m Jlvj y 


AAAGAAGACA 


(SEO 


ID 


NO«312) * 


AGAAGACAAG 


(SEQ 


ID 


NO : 3 14 ) : 


AAGACAAGAT 


(SEQ 


ID 


NO:316) ; 


GACAAGAT AT 


(SEQ 


ID 


NO: 318) ; 


CAAGATATCC 


(SEQ 


ID 


NO: 320); 


AGATATCCTT 


(SEQ 


ID 


NO: 322) ; 


ATATCCTTGA 


(SEQ 


ID 


NO:324) ; 


ATCCTTGATC 


(SEO 


ID 


HO : 326) t 


CCTTGATCTG 


(SEQ 


ID 


NO: 328) ; 


TTGATCTGTG 


(SEQ 


ID 


NO:330) ; 


QATCTGTGGA 


(SEQ 


ID 


NO: 332) ; 


TCTGTGGATC 


(SEQ 


ID 


NO: 334) ; 


TOTGOATCTA 


(SEQ 


ID 


NO: 336) ; 


TGK3ATCTACC 


(SEQ 


ID 


NO: 338) ; 
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CCACTTTTTA 


(SEQ 


ID 


NO: 265) ; 


ACTTTTTAAA 


(SEQ 


ID 


NO: 267) ; 


TTTTTAAAAG 


(SEQ 


ID 


NO: 269) ; 


TTTAAAAGAA 


(SEQ 


ID 


NO: 271) ; 


TAAAAGAAAA 


(SEQ 


ID 


NO:273) ; 


AAAGAAAAGG 


(SEQ 


ID 


NO:275) ; 


AGAAAAGGGG 


(SEQ 


ID 


NO: 277) ; 


AAAAGGGGGG 


(SEQ 


ID 


NO: 279) ; 


AAGGGGGGAC 


(SEQ 


ID 


NO: 281) ; 


GGGGGGACTG 


(SEQ 


ID 


NO: 283) ; 


GGGGACTGGA 


(SEQ 


ID 


NO: 285) ; 


GGACTGGAAG 


(SEQ 


ID 


NO: 287) ; 


ACTGGAAGGG 


(SEQ- 


ID 


NO:289) ; 


TGGAAGGGCT 


(SEQ 


ID 


NO:291) ; 


GAAGGGCTAA 


(SEQ 


ID 


NO: 293) ; 


AGGGCTAATT 


(SEQ 


ID 


N0:295); 


GGCTAATTCA 


(SEQ 


ID 


NO: 297) ; 


CTAATTCACT 


(SEQ 


ID 


NO: 299) ; 


AATTCACTCC 


(SEQ 


ID 


NO:301) ; 


TTCACTCCCA 


(SEQ 


ID 


N0:303) ; 


CACTCCCAAA 


(SEQ 


ID 


NO: 305) ; 


CTCCCAAAGA 


(SEQ 


ID 


NO:307) ; 




(SEO 


ID 


NO: 309) ; 


CAAAOAAGAC 


(SEQ 


ID 


NO: 311) ; 


AAGAAGACAA 


(SEQ 


ID 


NO:313) ; 


GAAGACAAGA 


(SEQ 


ID 


NO:315) ; 


AGACAAOATA 


(SEQ 


ID 


NO:317) ; 


ACAAGATATC 


(SEQ 


ID 


NO:319) ; 


AAGATATCCT 


(SEQ 


ID 


NO: 321) ; 


GATATCCTTG 


(SEQ 


ID 


NO: 323) ; 


TATCCTTGAT 


(SEQ 


ID 


NO:325) ; 


TCCTTGATCT 


(SEQ 


ID 


NO: 327) ; 


CTTGATCTGT 


(SEQ 


ID 


NO:329) ; 


TGATCTGTGG 


(SEQ 


ID 


NO:331) ; 


ATCTGTG6AT 


(SEQ 


ID 


NO: 333) ; 


CTQT6GATCT 


(SEQ 


ID 


NO:335) ; 


GTG6ATCTAC 


(SEQ 


ID 


NO:337) ; 


GGATCTACCA 


(SEQ 


ID 


NO:339) ; 
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GATCTACCAC (SEQ ID NO: 340) ; 
TCTACCACAC (SEQ ID NO: 342); 
TACCACACAC { SEQ ID NO: 344) ; 
CCACACACAA (SEQ ID NO: 346) ; 
ACACACAAGG (SEQ ID NO: 346) ; 
ACACAAGGCT (SEQ ID NO: 350) ; 
ACAAGGCTAC (SEQ ID NO: 352) ; 
AAGGCTACTT (SEQ ID NO: 354) ; 
6GCTACTTCC (SEQ ID NO: 356); 
CTACTTCCCT (SEQ ID NO: 358) ; 
ACTTCCCTGA (SEQ ID NO: 360); 
TTCCCTGATT (SEQ ID NO: 362} ; 
CCCT6ATTG6 (SEQ ID NO: 3 64); 
CTGATT66CA (SEQ ID NO:366) ; 
GATTGGCAGA (SEQ ID NO:368) ; 
TTGGCAGAAC (SEQ ID NO: 370); 
GGCAGAACTA (SEQ ID NO: 372) ; 
CAGAACTACA (SEQ ID NO: 374) ; 
6AACTACACA (SEQ ID NO: 376) ; 
ACTACACACC (SEQ ID NO: 378) ; 
TACACACCAG (SEQ ID NO: 380); 
CACACCAGGG (SEQ ID NO: 382) ; 
CACCAGGGCC (SEQ ID NO: 3 84) ; 
CCAGGGCCA6 (SEQ ID NO:386); 
AGGGCCAGGG (SEQ ID NO:388) ; 
GGCCAGGGGT (SEQ ID NO:390) ; 
CCAGGGGTCA (SEQ ID NO: 332) ; 
AGGGGTCAGA (SEQ ID NO: 394) ; 
GGGTCAGATA (SEQ ID NO: 396) ; 
GTCAGATATC (SEQ ID NO: 398) ; 
CAGATATCCA (SEQ ID NO: 400); 
GATATCCACT (SEQ ID NO: 4 02) ; 
TATCCACTGA (SEQ ID NO: 404) ; 
TCCACTGACC (SEQ ID NO: 406) ; 
CACTGACCTT (SEQ ID NO: 408) ; 
CTGACCTTTG (SEQ ID NO: 410) ; 
GACCTTXGGA (SEQ ID NO: 412) ; 
CCTTTGGATG (SEQ ID NO:414); 
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ATCTACCACA (SEQ ID NO: 341) ; 

CTACCAGACA (SEQ ID NO: 343) ; 
• ACCACACACA (SEQ ID NO:345) ; 

CACACACAAG (SEQ ID NO: 347) ; 

CACACAAGGC (SEQ ID NO: 349) ; 

CACAAGGCTA (SEQ ID NO:351); 

CAAGGCTACT (SEQ ID NO:353); 

AGGCTACTTC (SEQ ID NO: 355) ; 

GCTACTTCCC (SEQ ID NO: 3 57) ; 

TACTTCCCTG (SEQ ID NO: 359) ; 

CTTCCCTGAT (SEQ ID NO: 361); 

TCCCTGATTG (SEQ ID NO:363); 

CCTGATTGGC (SEQ- ID NO: 3 65) ; 
TGATTGGCAG (SEQ ID NO:367); 
ATTGGCAGAA (SEQ ID NO.-369) ; 
TGGCAGAACT (SEQ ID NO: 371); 
GCAGAACTAC (SEQ ID NO:373); 
AGAACTACAC (SEQ ID NO: 375) ; 
AACTACACAC (SEQ ID NO:377) ; 
CTACACACCA (SEQ ID NO: 379) ; 
ACACACCAGG (SEQ ID NO: 3 81) ; 
ACACCAGGGC (SEQ ID NO: 3 83) ; 
ACCAGGGCCA (SEQ ID NO: 3 85) ; 
CAGGGCCAGG (SEQ ID NO:387) ; 
GGGCCAGGGG (SEQ ID NO:389) ; 
GCCAGGGGTC (SEQ ID NO: 3 91) ; 
CAGGGGTCAG (SEQ ID NO:393); 
GGGGTCAGAT (SEQ ID NO: 395) ; 
GGTCAGATAT (SEQ ID NO: 3 97) ; 
TCAGATATCC (SEQ ID NO: 3 99) ; 
AGATATCCAC (SEQ ID NO: 401) ; 
ATATCCACTG (SEQ ID NO:403) ; 
ATCCACTGAC (SEQ ID NO: 4 05) ; 
CCACTGACCT (SEQ ID NO:407) ; 
ACTGACCTTT (SEQ ID NO: 409) ; 
TGACCTTTGG (SEQ ID NO:411) ; 
ACCTTTGGAT (SEQ ID N0:413); 
CTTTGGATGG (SEQ ID NO: 4 15) ; 
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TTTGGATGGT 


(SEQ 


ID 


NO:416) ; 


TGGATGGTGC 


(SEQ 


ID 


NO: 418) ; 


GATGGTGCTA 


(SEQ 


ID 


NO: 420) ; 


TGGTGCTACA 


(SEQ 


ID 


NO: 422) ; 


GTGCTACAAG 


(SEQ 


ID 


NO: 424) ; 


GCTACAAGCT 


(SEQ 


ID 


NO: 426) ; 


TACAAGCTAG 


(SEQ 


ID 


NO:428) ; 


CAAGCTAGTA 


(SEQ 


ID 


NO: 430) ; 


AGCTAGTACC 


(SEQ 


ID 


NO: 432) ; 


CTAGTACCAG 


(SEQ 


ID 


NO: 4 34) ; 


AGTACCAGTT 


(SEQ 


ID 


NO:436) ; 


TACCAGTTGA 


(SEQ 


ID 


NO:438) ; 


CCAGTTGAGC 


(SEQ 


ID 


NO:440) ; 


AGTTGAGCCA 


(SEQ 


ID 


NO: 442) ; 


TTGAGCCAGA 


(SEO 


ID 


NO: 444) ; 


GAGCCAGATA 


(SEQ 


ID 


NO:446) ; 


GCCAGATAAG 


(SEQ 


ID 


NO:448) ; 


CAGATAAGGT 


(SEQ 


ID 


NO:450) ; 


VJUnlitivJulilUt 


\ SDy 


ID 




TAAGGTArcAU 




xu 








xu 




GTAGAAGAGG 


(SEQ 


ID 


NO«45d) : 


AGAAGAGGCC 


(SEQ 


ID 


NO:460) ; 


AAGAGGCCAA 


(SEQ 


ID 


NO:462) ; 


GAGGCCAATA 


(SEQ 


ID 


NO:464) ; 


GGCCAATAAA 


(SEQ 


ID 


NO:466) ; 


CCAATAAAGG 


(SEQ 


ID 


NO:468) ; 


AATAAAGGAG 


(SEQ 


ID 


NO;470) ; 


TAAAGGAGAG 


(SEQ 


ID 


NO: 472) ; 


AAGGAGAGAA 


(SEQ 


ID 


NO:474) ; 


GGAGAGAACA 


(SEQ 


ID 


NO:476) ; 






TT"l 
2.XJ 


IMw • ti / O J f 


AGAACACCAG 


(SEQ 


ID 


NO: 480) ; 


AACACCAGCT 


(SEQ 


ID 


NO:482); 


CACCAGCTTQ 


(SEQ 


ID 


NO:484); 


CCAGCTTGTT 


(SEQ 


ID 


NO:486) ; 


AGCTTGTTAC 


(SEQ 


ID 


NO:488) ; 


CTTGTTACAC 


(SEQ 


ID 


NO: 490) ; 
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TTGGATGGTG 


(SEQ 


ID 


NO: 417) ; 


GGATGGTGCT 


(SEQ 


ID 


NO: 419) ; 


ATGGTGCTAC 


(SEQ 


ID 


NO:421) ; 


GGTGCTACAA 


(SEQ 


ID 


NO: 423) ; 


TGCTACAAGC 


(SEQ 


ID 


NO: 425) ; 


CTACAAGCTA 


(SEQ 


ID 


NO: 427) ; 


ACAAGCTA6T 


(SEQ 


ID 


NO: 429) ; 


AAQCTAGTAC 


(SEQ 


ID 


NO: 431) ; 


GCTAGTACCA 


(SEQ 


ID 


NO: 433) ; 


TAGTACCAGT 


(SEQ 


ID 


NO: 435) ; 


GTACCAGTTG 


(SEQ 


ID 


NO:437) ; 


ACCAGTTGAG 


(SEQ 


ID 


NO:439) ; 


CAGTTGAGCC 


(SEQ 


ID 


N0:441) ; 


GTTGAGCCAG 


(SEQ 


ID 


NO: 443) ; 


TGAGCCAGAT 


(SEQ 


ID 


NO: 445) ; 


AGCCAGATAA 


(SEQ 


ID 


NO: 447) ; 


CCAGATAAGG 


(SEQ 


ID 


NO:449) ; 


AGATAAGGTA 


(SEQ 


ID 


NO:451) ; 


*TA 2VGGTAGA 




ID 


NO* 453) : 






TD 
Xy 


X« W • TB 3 «3 / f 


GfTTA GA UGAG 






WO«457) * 


TAGAAGAGGC 


(SEO 


ID 


N0:459) ; 


GAAGAGGCCA 


(SEQ 


ID 


NO:461) ; 


AGAGGCCAAT 


(SEQ 


ID 


N0.463) ; 


AGGCCAATAA 


(SEQ 


ID 


NO:465) ; 


GCCAATAAAG 


(SEQ 


ID 


N0:467) ; 


CAATAAAGGA 


(SEQ 


ID 


NO:469) ; 


ATAAAGGAGA 


(SEQ 


ID 


NO:471) ; 


AAAGGAGAGA 


(SEQ 


ID 


NO:473) ; 


AGGAGAGAAC 


(SEQ 


ID 


N0:475) ; 


GAGAGAACAC 


(SEQ 


ID 


NO:477) ; 










GAACACCAGC 


(SEQ 


ID 


NO: 481) ; 


ACACCAGCTT 


(SEQ 


ID 


NO:483) ; 


ACCAGCTTGT 


(SEQ 


ID 


NO:485) ; 


CAGCTTGTTA 


(SEQ 


ID 


NO:487) ; 


GCTTGTTACA 


(SEQ 


ID 


NO: 489) ; 


TTGTTACACC 


(SEQ 


ID 


NO: 491) j 
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TGTTACACCC (SEQ ID NO: 492) / 
TTACACCCTG (SEQ ID NO: 494) ; 
ACACCCTGTG (SEQ ID NO:496); 
ACCCTGTGAG (SEQ ID NO: 498) ; 
CCK3TGAGCC (SEQ ID NO:500); 
TGTGAGCCTG (SEQ ID NO:502) ; 
TGAGCCTGCA (SEQ ID NO: 504) ; 
AGCCTGCATG (SEQ ID NO: 506) ; 
CCTGCATGGA (SEQ ID NO: 508) ; 
TGCATGGAAT (SEQ ID NO: 510) ; 
CATGGAATGG (SEQ ID NO: 512) ; 
TGGAATGGAT (SEQ ID NO: 514) ; 
GAATGGATGA (SEQ ID NO: 516) ; 
ATGQATGACC (SEQ ID NO: 518) ; 
GGATGACCCT (SEQ ID NO: 520); 
ATGACCCTQA (SEQ ID NO: 522); 
GACCCTGAGA (SEQ ID NO: 524) ; 
CCCTGAGAGA (SEQ ID NO: 526) ; 
CTGAGAGAGA (SEQ ID NO:528); 
GA6AGAGAAG (SEQ ID NO: 530) ; 
GAGAGAAGTG (SEQ ID NO:532); 
GAGAAGTGTT (SEQ ID NO: 534) ; 
GAAGTGTTAG (SEQ ID NO: 536); 
AGTGTTAGAQ (SEQ ID NO: 538) ; 
TGTTAGAGTG (SEQ ID NO: 540) ; 
TTAGAGTGGA (SEQ ID NO: 542); 
A6A6TGGAG6 (SEQ ID NO:544); 
AGTGGAGGTT- (SEQ ID NO:546) ; 
TGGAGGfTTTG (SEQ ID NO:548) ; 
QAGGTTTGAC (SEQ ID NO: 550); 
GGTTTGACAG (SEQ ID NO: 552); 
TTTGACAGCC (SEQ ID NO: 554) ; 
TGACAGCCGC (SEQ ID NO:556); 
ACAGCCGCCT (SEQ ID NO: 558); 
AGCCGCCTAG (SEQ ID NO: 560) ; 
CCGCCTAGCA (SEQ ID NO:562); 
GCCTAGCATT (SEQ ID NO: 564) ; 
CTAGCATTTC (SEQ ID NO: 566); 
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QTTACACCCT (SEQ ID NO: 4 93) ; 
TACACCCTGT (SEQ ID NO: 4 95) ; 
CACCCTGTGA (SEQ ID NO: 497) ; 
CCCTGTGAGC (SEQ ID NO--499) ; 
CTGTGAGCCT (SEQ ID N0:501); 
GTGAGCCTGC (SEQ ID NO:503) ; 
GAGCCTGCAT (SEQ ID NO: 5 05) ; 
GCCTGCATGG (SEQ ID NO: 507) ; 
CTGCATGGAA (SEQ ID NO: 509) ; 
GCATGGAATG (SEQ ID NO: 511) ; 
ATGGAATGOA (SEQ ID NO: 513) ; 
GGAATGGATG (SEQ ID NO: 515) ; 
AATGGATGAC (SEQ- ID NO: 517) ; 
TGGATGACCC (SEQ ID N0:519); 
GATGACCCTG (SEQ ID NO : 521); 
TGACCCTGAG (SEQ ID NO: 523) ; 
ACCCTGAGAG (SEQ ID NO:525); 
CCTGAGAGAG (SEQ ID NO: 527) ; 
TGAGAGAGAA (SEQ ID NO: 529) ; 
AGAGAGAAGT (SEQ ID NO: 531); 
AGAGAAGTGT (SEQ ID NO; 533) ; 
AGAAGTGTTA (SEQ ID NO:535); 
AAGTGTTAGA (SEQ ID NO: 537); 
GTGTTAGAGT (SEQ ID NO:539); 
GTTAGAGTGG (SEQ ID NO:541); 
TAGAGTGGAG (SEQ ID NO: 543); 
GAGTGGAGGT (SEQ ID NO: 545); 
GTGGAGGTTT (SEQ ID NO: 547) ; 
GGAGGTTTGA (SEQ ID NO: 549) ; 
AGGTTTGACA (SEQ ID NO:551); 
GTTTGACAGC (SEQ JD NO: 553) ; 
TTGACAGCCG (SEQ ID NO: 555) ; 
GACAGCCGCC (SEQ ID NO: 557) ; 
CAGCOGCCTA (SEQ ID NO: 559) ; 
GOOGCCTAGC (SEQ ID NO: 561) ; 
CGCCTAGCAT (SEQ ID NO:563); 
CCTAGCATTT (SEQ ID NO: 565) ; 
TAGCATTTCA (SEQ ID NO: 567); 
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AGCATTTCAT 


(SEQ 


ID NO: 568) 


CATTTCATCA 


(SEQ 


ID NO: 570) 


TTTCATCACG 


(SEQ 


ID NO: 572) 


TCATCACGTQ 


(SEQ 


ID NO: 574) 


ATCACGTGGC 


(SEQ 


ID NO: 576) 


CACGTGGCCC 


(SEQ 


ID NO:578) 


CGTGGCCCGA 


(SEQ 


ID NO: 580) 


TGGCCCGAGA 


(SEQ 


ID NO: 582} 


GCCCGAGAGC 


(SEQ 


ID NO: 5 84) 


CCGAGAGCTG 


(SEQ 


ID NO: 586) , 


GAGAGCTGCA 


(SEQ 


ID NO: 588) ; 


GAGCTGCATC 


(SEQ 


ID NO: 590) ; 


GCTGCATCCG 


(SEQ 


ID NO: 592) ; 


TGCATCCGGA 


(SEQ 


ID NO: 594) : 


CATCCGGAGT 


(SEQ 


ID NO: 596) ; 


TCCG6A6TAC 


(SEQ 


ID NO: 598) ; 


CGGAGTACTT 


(SEQ 


ID NO: 600) ; 


GAGTACTTCA 


(SEQ 


ID NO: 602) ; 


OTACTTCAAG 


(SEQ 


ID NO: 604) ; 


ACTTCAAGAA 


(SEQ 


ID NO: 606) ; 


TTCAAGAACT 


(SEQ 


ID NO: 608) ; 


CAAGAACTGC 


(SEQ 


ID NO: 610) ; 


AGAACTGCTG 


(SEQ 


ID NO: 612) ; 



GCATTTCATC (SEQ ID NO: 569) / 

ATTTCATCAC (SEQ ID NO: 571); 

TTCATCACGT (SEQ ID NO:573); 

CATCACGTGG (SEQ ID NO: 575) ; 

TCACGTGGCC (SEQ ID NO: 577); 

ACGTGGCCCG (SEQ ID NO: 579); 

GTGGCCCGAG (SEQ ID NO: 581); 

GGCCCGAGAG (SEQ ID NO:583); 

CCCGAGAGCT (SEQ ID NO: 585) ; 

CGAGAGCTGC (SEQ ID NO: 587) ; 
AGAGCTGCAT (SEQ ID NO: 589) ; 
AGCTGCATCC (SEQ ID NO: 591) ; 
CTGCATCCGG (SEQ ID NO: 593) ; 
GCATCCGGAG (SEQ ID NO: 595) ; 
ATCCGGAGTA (SEQ ID NO:597); 
CCGGA6TACT (SEQ ID NO: 599) ; 
GGAGTACTTC (SEQ ID NO: 6 01) ; 
AGTACTTCAA (SEQ ID NO: 6 03) ; 
TACTTCAAGA (SEQ ID NO: 605) ; 
CTTCAAGAAC (SEQ ID NO: 607); 
TCAAGAACTG (SEQ ID NO: 609) ; 
AAGAACTGCT (SEQ ID NO:611); 
GAACTGCTGA (SEQ ID NO: 613) . 



49. A method according to claim 44 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HTV-1 strain NL4-3: 
(SEQ ID NO: 652) ; 
(SEQ ID NO: 654) ; 
(SEQ ID NO: 656) ; 
(SEQ ID NO: 658) ; 
(SEQ ID NO: 660) ; 
(SEQ ID NO: 662); 
(SEQ ID NO: 664) ; 
(SEQ ID NO: 666) ; 
(SEQ ID NO: 668} ; 
(SEQ ID NO: 670) ; 
(SEQ ID NO: 672) ; 



GCTTTTTGCC 
TTTTTGCCTG 
TTTGCCTGTA 
TGCCTGTACT 
CCTGTACTGG 
TGTACTGGGT 
TACTGGGTCT 
CTGGGTCTCT 
GGGTCTCTCT 
GTCTCTCTGG 
CTCTCTGGTT 



CTTTTTGCCT 


(SEQ 


ID 


NO: 653) ; 


TTTTGCCTGT 


(SEQ 


ID 


NO: 655) ; 


TTGCCTGTAC 


(SEQ 


ID 


NO: 657) ; 


GCCTGTACTG 


(SEQ 


ID 


NO: 659) ; 


CTGTACTGGG 


(SEQ 


ID 


NO: 661) ; 


GTACTGGGTC 


(SEQ 


ID 


NO: 663) ; 


ACTGGGTCTC 


(SEQ 


ID 


NO:665) ; 


TGGGTCTCTC 


(SEQ 


ID 


NO: 6 67) ; 


GGTCTCTCTG 


(SEQ 


ID 


NO:669) ; 


TCTCTCTGGT 


(SEQ 


ID 


NO: 671) ; 


TCTCTGGTTA 


(SEQ 


ID 


NO: 673) ; 
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CTCTGGTTAG (SBQ ID NO: 674) ; TCTCTGGTTA (SEQ ID NO: 675) ; 
CTGGTTAGAC (SEQ ID NO: 676) ; TGGTTAGACC {SEQ ID NO: 677) ; 
GGTTAGACCA (SEQ ID NO: 678) ; GTTAGACCAG (SEQ ID NO: 6791) ; 
TTAGACCAGA (SEQ ID NO: 680) ; TAGACCAGAT (SEQ ID NO: 681) ; 
AGACCAGATC (SEQ ID NO: 682) ; GACCAGATCT (SEQ ID NO: 683) ; 
ACCAGATCTG (SEQ ID NO: 684); CCAGATCTGA (SEQ ID NO: 685) ; 

CAGATCTGAG (SEQ ID NO: 686) ; AGATCTGAGC (SEQ ID NO: 687) ; 

GATCTGAGCC (SEQ ID NO:688) ; ATCTGAGCCT (SEQ ID NO: 689) ; 

TCTGAGCCTG (SEQ ID NO: 690) ; CTGAGCCTGG (SEQ ID NO: 691); 

TGAGCCTGGG (SEQ ID NO: 692) ; GAGCCTGGGA (SEQ ID NO: 693); 

AGCCTGGGAG (SEQ ID NO:694); GCCTGGGAGC (SEQ ID NO:695); 

CCTGGGAGCT (SEQ ID NO: 696) ; CTGGGAGCTC (SEQ ID NO:697) ; 

TGGGAGCTCT {SEQ ID NO: 698) ; GGGAGCTCTC {SEQ ID NO: 699) ; 

GGAGCTCTCT (SEQ ID NO: 700) ; GAGCTCTCTG (SEQ ID NO: 701) ; 

AGCTCTCTGG (SEQ ID NO: 702); GCTCTCTGGC (SEQ ID NO: 703) ; 

CTCTCTGGCT (SEQ ID NO: 704) ; TCTCTGGCTA (SEQ ID NO: 705) ; 

CTCTGGCTAA (SEQ ID NO: 706) ; TCTGGCTAAC (SEQ ID NO: 707) ; 

CTGGCTAACT (SEQ ID NO: 708) ; TGGCTAACTA (SEQ ID NO: 709) ; 

GGCTAACTAG (SEQ ID NO: 710) ; GCTAACTAGG {SEQ ID NO: 711) ; 

CTAACTAGGG (SEQ ID NO: 712);- TAACTAGGGA (SEQ ID NO:713); 

AACTAGGGAA (SEQ ID NO:714); ACEAGGGAAC (SEQ ID NO:715); 

CTAGGGAACC (SEQ ID NO: 716) ; TAGGGAACCC (SEQ ID NO: 717) ; 

AGGGAACCGA (SEQ ID NO: 718) ; GGGAACCCAC (SEQ ID NO: 719) ; 

GGAACCCACT (SEQ ID NO: 720) ; GAACCCACTG (SEQ ID NO: 721) ; 

AACCCACTGC (SEQ ID NO: 722); ACCCACTGCT (SEQ ID NO: 723) ; 

CCCACTGCTT (SEQ ID NO: 724); CCACTGCTTA (SEQ ID NO: 725) ; 

CACTGCTTAA (SBQ ID NO: 726); ACTGCTTAAG (SEQ ID NO : 727); 

CTGCTTAAGC (SEQ ID NO: 728); TGCTTAAGCC (SEQ ID NO:729); 

GCTTAAGCCT (SEQ ID NO: 730) ; CTTAAGCCTC (SEQ ID NO: 731) ; 

TTAAGCCTCA (SEQ ID NO: 732) ; TAAGCCTCAA (SEQ ID NO:733); 

AAGCCTCAAT (SEQ ID NO: 734); AGCCTCAATA (SEQ ID NO: 735) ; 

GCCTCAATAA (SEQ ID NO: 736) ; CCTCAATAAA (SEQ ID NO:737); 

CTCAATAAAG (SEQ ID NO: 738) ; TCAATAAAGC (SEQ ID 110:739); 

CAATAAAGCT (SEQ ID NO: 740) ; AATAAAGCTT (SEQ ID NO: 741) ; 

ATAAAGCTTG (SEQ ID NO: 742); TAAAGCTTGC (SEQ ID NO: 743); 

AAAGCTTGCC (SEQ ID N0:744); AAGCTTGCCT (SEQ ID NO: 745) ; 

AGCTTGCCTT (SEQ ID NO: 746); GCTTGCCTTG (SEQ ID NO: 747); 

CTTGCCTTGA (SEQ ID NO: 748); TTGCCTTGAG (SEQ ID NO: 749) ; 
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TGCCTTGA6T 

CCTTGAGTGC 

TTOAGTGCTT 

GAGTGCTTCA 

GTGCTTCAAG 

GCTTCAAGTA 

TTCAAGTAGT 

CAAGTAGTGT 

AGTAGTGTGT 

TAGTGTGTGC 

GTGTGTGCCC 

GTGTGCCCGT 

GTGCCCGTCT 

GCCCGTCTGT 

CCGTCTGTTG 

GTCTGTTGTG 

CTGTTGTGTG 

GTTGTGTGAC 

TGTGTGACTC 

TGTGTGACTC 

TGTGACTCTG 

TGACTCTGGT 

ACTCTGGTAA 

TCTGGTAACT 

TGGTAACTAG 



(SEQ ID NO: 750 
(SEQ ID NO: 752 
(SEQ. ID NO: 754 
(SEQ ID NO: 756 
(SEQ ID NO: 758 
(SEQ ID NO: 760 
(SEQ ID NO: 762 
(SEQ ID NO: 764 
(SEQ ID NO: 766 
(SEQ ID NO: 768 
(SEQ ID NO: 770 
(SEQ ID NO: 772 
(SEQ ID NO: 774 
(SEQ ID NO: 776 
(SEQ ID NO: 778 
(SEQ ID NO: 780 
(SEQ ID NO: 782 
(SEQ ID NO: 784 
(SEQ ID NO: 786 
(SEQ ID NO: 788 
(SEQ ID NO: 790 
(SEQ ID NO: 792 
(SEQ ID NO: 794 
(SEQ ID NO: 796 
(SEQ ID NO: 798 



GCCTTGAGTG 

CTTGAGTGCT 

TGAGTGCTTC 

AGTGCTTCAA 

TGCTTCAAGT 

CTTCAAGTAG 

TCAAGTAGTG 

AAGTAGTGTG 

GTAGTGTGTG 

AGTGTGTGCC 

TGTGTGCCCG 

TGTGCCCGTC 

TGCCCGTCTG 

CCCGTCTGTT 

CGTCTGTTGT 

TCTGTTGTGT 

TGTTGTGTGA 

TTGTGTGACT 

GTGTGACTCT 

GTGTGACTCT 

GTGACTCTGG 

GACTCTGGTA 

CTCTGGTAAC 

CTGGTAACTA 

GGTAACTAGA 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 751) 
NO: 753) 
NO: 755) 
NO: 757) 
NO: 759) 
NO: 761) 
NO: 763) 
NO: 765) 
NO: 767) 
NO: 769) 
NO: 771) 
NO: 773) 
NO: 775) 
NO:777) 
NO: 779) 
NO: 781) 
NO: 783) 
NO: 785) 
NO: 787) 
NO: 789) 
NO: 791) 
NO:793) 
NO: 795) ; 
NO: 797) ; 
NO: 799) . 



SO. A method according to claim 24 wherein said HTV-1 isolate has the identifying 
characteristics of HTV-1 isolate ECACC V94101706 or HTV-1 isolate ECACC 
V941031169or isolate C54 deposited at ECACC on 14 February, 1995 under Accession 
Number . 



51. A method according to claim 20 wherein the HTV-1 isolate is reactive to 
antibodies to a glycoprotein of HTV-1, is capable of inducing an immune response of 
gag, pol and/or em and which is incapable of directing synthesis of a nef gene product 
or a full length nef gene product 
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52. A method according to claim 51 wherein the glycoprotein is at least one of gp41- 
45, gpl20 and/or gpl60. 

53. A method for obtaining a pr ep arat ion of non-pathogenic HTV-1 from a biological 
sample, said method comprising co-culturing PBMCs from said biological sample from 
an individual putatively infected with said non-pathogenic HTV-1 with HTV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the PBMCs and 
supernatant fluid every from about 5 to about 10 days and adding fresh medium with 
CD8+ depleted PBMCs with said fresh medium and isolating said virus from the 
supernatant fluid 

54. A method according to claim 53 wherein the smmegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HIV- 1 replication. 

55. A method according to claim 54 wherein the cytokine is M-CSF. 

56. A method according to claim 53 or 54 or 55 wherein the HTV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 

57. A method according to claim S3 wherein the PBMCs from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

58. A method according to claim 53 or 57 wherein the cocultured cells are subjected 
to UV irradiation. 

59. A method for obtaining a preparation of non-pathogenic HTV-1 from a biological 
sample, said method comprising co-culturing monocytes from said biological sample 
from an individual putatively infected with said non-pathogenic HIV-1 with HTV-1 
seronegative donor PBMCs depleted for CD8+ cells, harvesting the monocytes and 
PBMCs and supernatant fluid every from about 5 to about 10 days and adding fresh 
medium with CD8+ depleted PBMCs with said fresh medium and isolating said virus 
from the supernatant fluid 
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60. A method according to claim 59 wherein the seronegative PBMCs are pretreated 
with a cytokine for a time and under conditions sufficient to enhance HTV-1 replication. 

61. A method according to claim 60 wherein the cytokine is M-CSF. 

62. A method according to claim 59 or 60 or 61 wherein the HTV-1 seronegative 
donor PBMCs are first stimulated with PHA and/or IL-2. 

63. A method according to rfaim 59 wherein the monocytes from a putative infected 
individual are also CD8+ depleted prior to coculturing with seronegative donor PBMCs. 

64. A method according to claim 59 or 63 wherein the cocultured cells are subjected 
to UV irradiation. 

65. A method for identifying or screening for compounds capable of reducing or 
otherwise interfering with HIV-1 replication, said method comprising contacting a 
compound to be tested with a cell or cell extract containing or capable of containing a 
nef gene product fused to a reporter molecule capable of giving an identifiable signal 
and screening for a compound which inhibits said signal. 

66. A method according to claim 65 wherein the compound is an antibody to nef 
gene product or a part thereof. 

67. A method according to claim 65 wherein die compound is a DNA targeting agent 
and inhibits transcription of the nef gene. 

68. A compound capable of inhibiting nef gene product function or nef gene 
transcription. 

69. A compound capable of inhibiting nef gene product function or nef gene 
transcription identified according to the method of claim 65 or 66 or 67. 
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70. A therapeutic composition useful for inhibiting or reducing productive infection 
by a pathogenic strain of HTV-1 and/or for vaccinating an individual against the 
development of AIDS or AIDS-related diseases, said composition comprising a non- 
pathogenic strain of HIV-1 according to any one of claims 1 to 16 and one or more 
pharmaceutical acceptable carriers and/or diluents. 

71. A therapeutic composition according to claim 70 wherein said HTV-1 isolate 
comprises genetic material that directs expression of antisense or ribozyme nucleotide 
sequences which inhibit production of one or more proteins encoded by a pathogenic 
strain of HTV-1. 

72. A viral isolate which: 

(i) is genetically or immunologically related to a pathogenic strain of HTV-1; 

(ii) is substantially non-pathogenic in human subjects; 

(iii) comprises a first nucleotide sequence constituting its genome which is 
capable of hybridising under medium stringency conditions to SEQ ID 
NO: 1 or a complementary form thereof; and 

(iv) comprises a second nucleotide sequence within said first nucleotide 
sequence and which second nucleotide sequence directs expression of a 
mRNA molecule capable of inhibiting, reducing or otherwise down- 
regulating translation of a protein or polypeptide encoded by a pathogenic 
strain of HIV-1 or inhibit, reduce or otherwise down regulate operation 
of a non-protein encoding a region of a pathogenic strain of HTV-1. 

73. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of gag or poL 

74. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of env or tat. 

75. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the Est consisting of rev or vpu. 
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76. A viral isolate according to claim 72 wherein the targeted protein or polypeptide 
is selected from the list consisting of vpr, w/ or nef. 

77* A viral isolate according to claim 72 wherein the targeted protein is nef 

78. A method for determining the pathogenicity of an HTVM strain after said HIV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in the genome of said HIV-1 wherein said deletion mutation 
results in said genome being unable to synthesise a polypeptide or protein from a 
pathogenic strain of HIV-1 or directing the synthesis of a truncated form of said 
polypeptide or protein wherein the presence of a such a mutation is indicative of the 
presence of a non-pathogenic strain of HIV-1 . 

79. The method according to claim 78 wherein said non-pathogenic HTV-1 carries 
a deletion in its genome of at least 10 nucleotides within the region from nucleotide 
8787 to nucleotide 9709 using the nucleotide numbering of HIV-1 strain NL4-3. 

80. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 

nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iil) 9019-9029; and 

Gv) 9033-9049. 

81. A strain of HIV-1 according to claim 79 wherein said non-pathogenic HIV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of. 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vu) 9105-9224; and 

(viii) 9271-9370. 
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82. A strain of HTV-1 according to claim 79 wherein said non-pathogenic HTV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 
nucleotide fix) 8882-8928: 



(ix) 

W 
(xi) 
(xii) 



8850-9006 
8792-9041 
9112-9204. 



and 



83. A strain of HTV-I according to claim 79 wherein said non-pathogenic HTV-1 
carries a deletion in its genome of at least 10 nucleotides from within a region selected 
from the list consisting of: 
nucleotide (xiii) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 



84. A strain 

least one of the 

ATGGGTGGCA 

GGGTGGCAAG 

GTGGCAAGTG 

GGCAAGTGGT 

CAAGTGGTCA 

AGTGGTCAAA 

TGGTCAAAAA 

GTCAAAAAGT 

CAAAAAGTAG 

AAAAGTAGTG 

AAGTAGTGTG 

GTAGTGTGAT 

AGTGTGATTG 

TGTGATTGGA 

TGATTGGATG 

ATTGGATGGC 

TGGATGGCCT 

GATGGCCTGC 



of HTV-1 according to claim 79 wherein said deletion encompasses at 
following decanucleotides from the nef gene of HTV-1 strain NL4-3: 

TGGGTGGCAA (SEQ ID NO:3) ; 

GGTGGCAAGT (SEQ ID NO: 5) ; 

TGGCAAGTGG (SEQ ID NO:7); 

GCAAGTGGTC (SEQ ID NO: 9); 

AAGTGGTCAA (SEQ ID NO: 11) ; 

GTGGTCAAAA (SEQ ID NO: 13); 

GGTCAAAAAG (SSQ ID NO:15) ; 

TCAAAAAGTA (SEQ ID NO? 17); 

AAAAAGTAGT (SEQ ID NO: 19); 

AAAGTAGTGT (SEQ ID NO: 21); 

AGTAGTGTGA (SEQ ID NO: 23) ; 

TAGTOTGATT (SEQ ID NO: 25); 

6TGT6ATTGG (SEQ ID NO: 27) 

GTGATTG6AT (SEQ ID NO: 29) 

GATTGGATGG (SEQ ID NO: 31) 

TTGGATGGCC (SEQ ID NO:33) 

GGATGGCCTG (SBQ ID NO:35) 

ATGGCCTGCT (SEQ ID NO:37) 



(SEQ 


ID 


NO: 2) ; 


(SEQ 


ID 


NO: 4) ; 


(SEQ 


ID 


NO: 6) ; 


(SEQ 


ID 


NO:8) ; 


(SEQ 


ID 


NO:10) ; 


(SEQ 


ID 


NO:12) ; 


(SEQ 


ID 


NO:14) ; 


(SEQ 


ID 


NO:16> ; 


(SEQ 


ID 


NO:18) ; 


(SEQ 


ID 


NO:20) ; 


(SEQ 


ID 


NO:22) ; 


(SEQ 


ID 


NO: 24) ; 


(SEQ 


ID 


NO:26) ; 


(SEQ 


ID 


NO:28) ; 


(SEQ 


ID 


NO:30) ; 


(SEQ 


ID 


NO:32); 


(SEQ 


ID 


NO:34) ; 


(SEQ 


ID 


NO: 36) ; 
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TGGCCTOCTO (SEQ ID NO: 38); 
GCCTOCTQTA (SEQ ID NO:40); 
CTGCTGTAAG (SEQ ID NO: 42); 
6CT6TAAGGG (SEQ ID NO: 44); 
TGTAAGGGAA (SEQ ID NO.-46); 
TAAGGGAAAG (SEQ ID NO: 48) ; 
AGGGAAAGAA (SEQ ID NO:50); 
GGAAAGAATG (SEQ ID NO: 52); 
AAAGAATGAG (SEQ ID NO: 54) ; 
AGAATGAGAC (SEQ ID NO: 56) ; 
AATGAGACGA (SEQ ID NO: 58); 
TGAGACGAGC (SEQ ID NO: 60); 
AGACGAGCTG (SEQ ID NO: 62) ; 
ACGAGCTGAG (SEQ ID NO: 64) ; 
GAGCTGAGCC (SEQ ID NO: 66) ; 
GCTGAGCCAG (SEQ ID NO: 68); 
TGAGCCAGCA (SEQ ID NO: 70) ; 
AGCCAGCAGC (SEQ ID NO: 72); 
CCAGCAGCAG (SEQ ID NO: 74); 
AGCAGCAGAT (SEQ ID NO: 76) ; 
CAGCAGATGG (SEQ ID NO: 78); 
GCAGATGGGG (SEQ ID NO: 80) ; 
AGATGGGGTG (SEQ ID NO: 82) ; 
ATGGGGTGGG (SEQ ID NO: 84) ; 
GGGGTGGGAG (SEQ ID NO: 86) ; 
GGTGGGAGCA (SEQ ID NO: 88) ; 
TGGGAGCAGT (SEQ ID NO: 90) ; 
GGAGCAGTAT (SEQ ID NO: 92) ; 
AGCAGTATCT (SEQ ID NO: 94); 
CAGTATCTCG (SEQ ID NO: 96); 
GTATCTCGAG (SEQ ID NO: 98) ; 
ATCTCGAGAC (SEQ ID NO:100) ; 
CTCGAGACCT (SEQ ID NO : 102); 
CGAGACCTAG (SEQ ID NO: 104); 
AGACCTAGAA (SEQ ID NO: 106); 
ACCTAGAAAA (SEQ ID NO: 108); 
CTAGAAAAAC (SEQ ID NO: 110) ; 
AGAAAAACAT (SEQ ID NO: 112); 



GGCCTGCTGT 


(SEQ 


ID 


NO: 


39 


CCTGCTGTAA 


(SEQ 


ID 


NO: 


41 


TGCTGTAAGG 


(SEQ 


ID 


NO: 


43 


CTGTAAGGGA 


(SEQ 


ID 


NO: 


45 


GTAAGGGAAA 


(SEQ 


ID 


NO: 


47 


AAGGGAAAGA 


(SEQ 


ID 


NO: 


49 


GGGAAAGAAT 


(SEQ 


ID 


NO: 


51 


GAAAGAATGA 


(SEQ 


ID 


NO: 


53 


AAGAATGAGA 


(SEQ 


ID 


NO: 


55 


GAATGAGACG 


(SEQ 


ID 


NO: 


57 


ATGAGACGAG 


(SEQ 


ID 


NO: 


59 


GAGACGAGCT 


(SEQ 


ID 


NO: 


61 


GACGAGCTGA 


(SEQ 


ID 


NO: 


63] 


CGAGCTGAGC 


(SEQ 


ID 


NO: 


65! 


AGCTGAGCCA 


(SEQ 


ID 


NO: 


67] 


CTQAGCGAGC 


(SEQ 


ID 


NO: 


69] 


GAGCCAGCAG 


(SEQ 


ID 


NO: 


71] 


GCCAGCAGCA 


(SEQ 


ID 


NO: 


73 




(SEO 


ID 


NO: 


75 






ID 


NO* 


77 


AGCAGATGGG 


(SEO 


ID 


NO: 


79! 


CAGATGGGGT 


(SEO 


ID 


NO: 


81! 


GATGGGGTGG 


(SEQ 


ID 


NO: 


83] 


TGGGGTGGGA 


(SEQ 


ID 


NO: 


85] 


GGGTGGGAGC 


(SEQ 


ID 


NO: 


87] 


GTGGGAGCAG 


(SEQ 


ID 


NO: 


89] 


GGGAGCAGTA 


(SEQ 


ID 


NO: 


91] 


GAGCAGTATC 


(SEQ 


ID 


NO: 


93] 


GCAGTATCTC 


(SEQ 


ID 


NO: 


95 


AGTATCTCGA 


(SEQ 


ID 


NO: 


97 


TATCTCGAGA 


(SEQ 


ID 


NO: 


99 


TCTCGAGACC 


(SEQ 


ID 


NO: 101 



TCGAGACCTA (SEQ ID NO: 103) 

GAGACCTAGA (SEQ ID NO:105) 

GACCTAGAAA (SEQ ID NO: 107) 

CCTAGAAAAA (SEQ ID NO: 109) 

TAGAAAAACA (SEQ ID NO: 111) 

GAAAAACATG (SEQ ID NO: 113) 



WO 95/21912 



AAAAACATGG (SEQ ID NO: 114) 
AAACATGGAG (SEQ ID NO: 116) 
ACATGGAGCA (SEQ ID NO: 118) 
ATGGAGCAAT (SEQ ID NO: 120) , 
GGAGCAATCA (SEQ ID NO: 122) ; 
AGCAATCACA (SEQ ID NO: 124) ; 
CAATCACAAG (SEQ ID NO: 126) ; 
ATCACAAGTA (SEQ ID NO: 128); 
CACAAGTAGC (SEQ ID NO:130); 
CAAGTAGCAA (SEQ ID NO: 132) ; 
AGTAGCAATA (SEQ ID NO: 134); 
TAGCAATACA (SEQ ID NO: 136); 
GCAATACAGC (SEQ ID NO:138); 
AATACAGCAG (SEQ ID N0:140); 
TACAGCAGCT (SEQ ID NO: 142); 
CAGCAGCTAA (SEQ ID NO: 144) ; 
GCAGCTAACA (SEQ ID NO: 146); 
AQCTAACAAT (SEQ ID NO: 148) ; 
CTAACAATGC ( SEQ ID NO : 150 ) ; 
AACAATGCTG (SEQ ID NO: 152); 
CAATGCTGCT (SEQ ID NO:154) ; 
ATGCTGCTTG (SEQ ID NO:156); 
GCTGCTTGTG (SEQ ID NO: 158); 
TGCTTGTGCC (SEQ ID NO: 160); 
CTTGTSCCTS (SEQ ID NO:162); 
TGTGCCTGGC (SEQ ID NO:164); 
TGCCTGGCTA (SEQ ID NO: 166) ; 
CCTGGCTAGA (SEQ ID NO: 168) ; 
TGGCTAGAAG (SEQ ID NO: 170); 
GCTAGAAGCA (SEQ ID NO: 172); 
TAGAAGCACA (SEQ ID NO:174); 
GAAGCACAAG (SEQ ID NO: 176} ; 
AGCACAAGAG (SEQ ID NO: 178) ; 
CACAAGAGGA (SEQ ID NO: 180) ; 
CAAGAGGAGG (SEQ ID NO:182); 
AGAGGAGGAA (SEQ ID NO: 184) ; 
AGGAGGAA6A (SEQ ID ND.-186); 
GAGGAAGAGG (SEQ ID NO: 188) ; 
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AAAACATGGA (SEQ ID NO: 115) 
AACATGGAGC (SEQ ID NO: 117) 
CATGGAGCAA (SEQ ID NO: 119) 
TGGAGCAATC (SEQ ID NO: 121) 
GAGCAATCAC (SEQ ID NO: 123) 
GCAATCACAA (SEQ ID NO: 125) 
AATCACAAGT (SEQ ID NO: 127) 
TGACAAGTAG (SEQ ID NO: 129) , 
ACAAGTAGCA (SEQ ID NO: 131) ; 
AAGTAGCAAT (SEQ ID NO: 133) ; 
GTAGCAATAC (SEQ ID NO: 135) ; 
AGCAATACAG (SEQ ID NO: 13 7) ; 
CAATACAGCA (SEQ ID NO: 139) ; 
ATACAGCAGC (SEQ ID NO: 141) ; 
ACAGCAGCTA (SEQ ID NO: 143) ; 
AGCAGCTAAC (SEQ ID NO: 145) ; 
CAGCTAACAA (SEQ ID NO: 147); 
GCTAACAATG (SEQ ID NO:149); 
TAACAATGCT (SEQ ID NO:151); 
ACAATGCTGC (SEQ ID NO:153); 
AATGCTGCTT (SEQ ID NO:155); 
TGCTGCTTGT (SEQ ID NO: 157); 
CTGCTTGTGC (SEQ ID NO: 159) ; 
GCTTGTGCCT (SEQ ID NO:161); 
TTGTGCCTGG (SEQ ID NO: 163) ; 
GTGCCTGGCT (SEQ ID NO:165); 
GCCTGGCTAG (SEQ ID NO: 167); 
CTGGCTAGAA (SEQ ID N0:169); 
GGCTAGAAGC (SEQ ID NO: 171) ; 
CTAGAAGCAC (SEQ ID NO:173); 
AGAAGCACAA (SEQ. ID NO: 175) ; 
AAGCACAAGA (SEQ ID NO: 177) ; 
GCACAAGAGG (SEQ ID NO: 179); 
ACAAGAGGAG (SEQ ID NO: 181) ; 
AAGAGGAGGA (SEQ ID NO: 183) ; 
GAGGAGGAAG, (SEQ ID NO: 185) ; 
GGAGGAAGAG (SEQ ID NO:187); 
AGGAAGAGGT (SEQ ID NO: 189) ; 
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GGAAGAGGTG (SEQ ID NO: 190) ; 
AAGAGGTGGG (SEQ ID NO: 192) ; 
GAGGTOGGTT (SEQ ID NO: 194) ; 
GGTGGGTTTT (SEQ ID NO: 196) ; 
TGGGTTTTCC (SEQ ID NO: 198) ; 
GGTTTTCCAG (SEQ ID NO-.200) ; 
TTTTCCAGTC (SEQ ID NO: 202) ; 
TTCCAGTCAC (SEQ ID NO: 2 04) ; 
CCAGTCACAC (SEQ ID NO: 206) ; 
AGTCACACCT .(SEQ ID NO:208); 
TCACACCTCA (SEQ ID NO: 210) ; 
ACACCTCAGG (SEQ ID NO: 212) ; 
ACCTCAGGTA (SEQ ID NO: 214) ; 
CTCAGGTACC (SEQ ID NO: 216); 
CAGGTACCTT (SEQ ID NO: 218) ; 
GGTACCTTTA (SEQ ID NO: 220} ; 
TACCTTTAAG (SEQ ID NO: 222); 
CCTTTAAGAC (SEQ ID NO: 224) ; 
TTTAAGACCA (SEQ ID NO: 226) ; 
TAAGACCAAT (SEQ ID NO: 22 8) ; 
AGACCAATGA (SEQ ID NO:230) ; 
ACCAATGACT (SEQ ID NO: 232} ; 
CAATGACTTA (SEQ ID NO: 234); 
ATGACTTACA (SEQ ID NO: 23 6); 
GACTTACAAG (SEQ ID NO: 238); 
CTTACAAGGC (SEQ ID NO: 240) ; 
TACAAGGCAG (SEQ ID NO: 242) ; 
CAAGGCAGCT (SEQ ID NO: 244) ; 
AGGCAGCTGT (SEQ ID NO: 246) ; 
GCAGCTGTAG (SEQ ID NO: 248) ; 
AGCTGTAGAT (SEQ ID NO: 250) ; 
CTGTAGATCT (SEQ ID NO: 252) ; 
GTAGATCTTA (SEQ ID NO: 254) ; 
AGATCTTAGC (SEQ ID NO: 25 6) ; 
ATCTTAGCCA (SEQ ID NO: 258); 
CTTAGCCACT (SEQ ID NO: 260); 

: 

TAGCCACTTT (SEQ ID NO: 262) ; 
GCCACTTTTT (SEQ ID NO: 264) ; 
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GAAGAGGTGG 


(SEQ 


ID 


NO:191) ; 


AGAGGTGGGT 


(SEQ 


ID 


NO: 193) ; 


AGGTGGS3TTT 


(SEQ 


ID 


NO: 195) ; 


GTGGGTTTTC 


(SEQ 


ID 


NO: 197) ; 


GGGTTTTCCA 


(SEQ 


ID 


NO: 199) ; 


GTTTTCCAGT 


(SEQ 


ID 


NO: 201) ; 


TTTCCAGTCA 


(SEQ 


ID 


NO: 203) ; 


TCCAGTCACA 


(SEQ 


ID 


NO:205) ; 


CAGTCACACC 


(SEQ 


ID 


NO:207) ; 


GTCACACCTC 


(SEQ 


ID 


NO: 209) ; 


CACACCTCAG 


(SEQ 


ID 


NO: 211) ; 


CACCTCAGGT 


(SEQ 


ID 


NO: 213) ; 


CCTCAGGTAC 


(SEQ 


ID 


NO: 215) ; 


TCAGGT&CCT 


(SEQ 


ID 


NO: 217) ; 


AGGTACCTTT 


(SEQ 


ID 


NO: 219) ; 


GTACCTTTAA 


(SEQ 


ID 


NO: 221) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO: 223) ; 


CTTTAAGACC 


(SEQ 


ID 


NO: 225) ; 


TTAAGACCAA 


(SEQ 


ID 


NO: 227) ; 


AAGACCAATG 


(SEQ 


ID 


NO: 229) ; 


GACCAATGAC 


(SEQ 


ID 


NO: 231) ; 


CCAATGACTT 


(SEQ 


ID 


NO: 233) ; 


AATGACTTAC 


(SEQ 


ID 


NO: 235) ; 


TGACTTACAA 


(SEQ 


ID 


NO: 237} ; 


ACTTACAAGG 


(SEQ 


ID 


NO: 239) ; 


TTACAAGGCA 


(SEQ 


ID 


NO: 241) ; 


ACAAGGCAGC 


(SEQ 


ID 


NO: 243) ; 


AAGGCAGCTG 


(SEQ 


ID 


NO: 245) ; 


GGCAGCTGTA 


(SEQ 


ID 


NO: 247) ; 


CAGCTGTAGA 


(SEQ 


ID 


NO: 249) ; 


GCTGTAGATC 


(SEQ 


ID 


NO:251) ; 


TGTAGATCTT 


(SEQ 


ID 


NO: 253) ; 


TAGATCTTAG 


(SEQ 


ID 


NO: 255) ; 


GATCTTAGCC 


(SEQ 


ID 


NO:257) ; 


TCTTAGCCAC 


(SEQ 


ID 


NO: 259) ; 


TTAGCCACTT 


(SEQ 


ID 


NO: 261) ; 


AGCCACTTTT 


(SEQ 


ID 


NO: 263) ; 


CCACTTTTTA 


(SEQ 


ID 


NO: 265) ; 
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CACTTTTTAA (SEQ ID NO:266) ; 
CTTTTTAAAA (SEQ ID NO:268); 

TTTTAAAAGA (SEQ ID NO:270) ; 

TTAAAAGAAA (SEQ ID NO:272) ; 

AAAAGAAAAG (SEQ ID NO:274); 

AAGAAAAGGG (SEQ ID NO:276) ; 

GAAAAGGGGG (SEQ ID NO:278) ; 

AAAGGGGGGA (SEQ ID NO: 280); 

AGGGGGGACT (SEQ ID NO:282) ; 

GGGGGACTGG (SEQ ID NO: 284) ; 

GGGACTGGAA (SEQ ID NO:286); 

GACTGGAAGG (SEQ ID NO:288); 

CTGGAAGGGC (SEQ ID NO:290) ; 

GGAAGGGCTA (SEQ ID NO:292); 
AAGGGCTAAT (SEQ ID NO: 294) ; 

GGGCTAATTC (SEQ ID NO:296); 

GCTAATTCAC (SEQ ID NO: 2 98) ; 
TAATTCACTC (SEQ ID NO:3Q0); 
ATTC3iCTCCC (SEQ ID NO:302) ; 
TCACTCCCAA (SEQ ID NO:304) ; 
ACTCCCAAAG (SEQ ID NO: 3 06) ; 
TCCCAAAGAA (SEQ ID NO: 308) ; 
CCAAAGAAGA (SEQ ID NO:310) ; 
AAAGAAGACA (SEQ ID NO:312) ; 
AGAAGAGAAG (SEQ ID NO;314) / 
AAGACAAGAT (SEQ ID NO: 316) ; 
GACAAGATAT (SEQ ID NO:318); 
CAAGATATCC (SEQ ID NO:320) ; 
AGATATCCTT (SEQ ID NO:322) ; 
ATATCCTTGA (SEQ ID NO:324); 
ATCCTTGATC (SEQ ID NO:326) ; 
CCTTGATCTG (SEQ ID NO:328) ; 
TTGATCTGTG (SEQ ID NO:330); 
GATCTGTGGA (SEQ ID NO: 332) ; 
TCTGTGGATC (SEQ ID NO:334); 
TGTGGATCTA (SEQ ID NO:336); 
TGGATCTACC (SEQ ID NO:338); 
OATCTACCAC (SEQ ID NO: 340) ; 
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ACTTTTTAAA (SEQ ID NO: 267) ; 
TTTTTAAAAG (SEQ ID NO:269); 
TTTAAAAGAA (SEQ ID NO:271) ; 
TAAAAGAAAA (SEQ ID NO:273); 
AAAGAAAAGG (SEQ ID NO:275) ; 
AGAAAAGGGG (SEQ ID NO:277) ; 
AAAAGGGGGG (SEQ ID NO:279); 
AAGGGGGGAC (SEQ ID NO:281) ; 
GGGGGGACTG (SEQ ID NO:283); 
GGGGACTGGA (SEQ ID NO:285); 
GGACTGGAAG (SEQ ID NO:287) ; 
ACTGGAAGGG (SEQ ID NO:289); 
TGGAAGGGCT (SEQ ID NO:291) ; 
GAAGGGCTAA (SEQ ID NO:293); 
AGGGCTAATT (SEQ ID NO: 295) ; 
GGCTAATTCA (SEQ ID NO:297) ; 
CTAATTCACT (SEQ ID NO:299) ; 
AATTCACTCC (SEQ ID NO:301); 
TTCACTCCCA (SEQ ID NO:303); 
CACTCCCAAA (SEQ ID NO: 3 05) ; 
CTCCCAAAGA (SEQ ID NO: 3 07) ; 
CCCAAAGAAG (SEQ ID NO:309) ; 
CAAAGAAGAC (SEQ ID NO: 311) ; 
AAGAAGACAA (SEQ ID NO:313); 
GAAGACAAGA (SEQ ID NO: 315) ; 
AGAGAAGATA (SEQ ID NO:317) ; 
ACAAGATATC (SEQ ID NO: 319) ; 
AAGATATCCT (SEQ ID NO:321); 
GATATCCTTG (SEQ ID NO:323); 
TATCCTTGAT (SEQ ID NO: 325) ; 
TCCTTGATCT (SEQ. ID NO; 327) ; 
CTTGATCTGT (SEQ ID NO:329) ; 
TCATCTGTGG (SEQ ID NO:33l); 
ATCTGTGGAT (SEQ ID NO: 333) ; 
CTGTGGATCT (SEQ ID NO:335) ; 
GTGGATCTAC (SEQ ID NO:337) ; 
GGATCTACGA (SEQ ID NO: 339) ; 
ATCTACCACA (SEQ ID NO:341); 
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TCTACCACAC 


(SEQ 


ID 


NO: 342) ; 


TACCACACAC 


(SEQ 


ID 


NO: 344) ; 


CCACACACAA 


(SEQ 


ID 


NO: 346) ; 


ACACACAAGG 


(SEQ 


ID 


NO: 348) ; 


ACACAAGOCT 


(SEQ 


ID 


NO: 350) ; 


ACAAGGCTAC 


(SEQ 


ID 


NO: 352) ; 


AAGGCTACTT 


(SEQ 


ID 


NO: 354) ; 


GGCTACTTCC 


(SEQ 


ID 


NO: 356} ; 


CTACTTCCCT 


(SEQ 


ID 


NO: 358} ; 


ACTTCCCTGA 


(SEQ 


ID 


NO:360) ; 


TTCCCTGATT 


(SEQ 


ID 


NO: 362) ; 


CCCTGATTGG 


(SEQ 


ID 


NO: 364) ; 


CTGATTGGCA 


(SEQ 


ID 


NO: 366) ; 


GATTGGCAGA 


(SEQ 


ID 


NO: 368) ; 


TTGGCAGAAC 


(SEQ 


ID 


NO:370) ; 


GGCAGAACTA 


(SEQ 


ID 


NO: 372} ; 


CAGAACTACA 


(SEQ 


ID 


NO: 374) ; 


GAACTACACA 


(SEQ 


ID 


NO: 376) ; 




(SEO 


ID 


NO* 378 ) : 


TAPAPAfY^Af* 
1 nl«ri WVw Wtur 




TT1 


liw • JO v / j 
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CACCAGGGCC 


( SEO 


ID 


NO* 384) ; 


CCAGGGCCAG 


(SEQ 


ID 


NO: 386) ; 


AGGGCCAGGG 


(SEQ 


ID 


NO:3B8) ; 


GOCCAGGGGT 


(SEQ 


ID 


NO:390) ; 


CCAGGGGTCA 


(SEQ 


ID 


NO: 392) ; 


AGGGGTCAGA 


(SEQ 


ID 


NO: 394} ; 


GGGTCAGATA 


(SEQ 


ID 


NO: 396) ; 


GTCAGATATC 


(SEQ 


ID 


NO: 398) ; 


CAGATATCCA 


(SEQ 


ID 


NO:400) ; 


GATATCCACT 


(SEQ 


ID 


NO:402) ; 


TATCCACTGA 


(SEO 


ID 


NO: 404) ; 


TCCACTGACC 


(SEQ 


ID 


NO:406) ; 


CACTGACCTT 


(SEQ 


ID 


NO: 408) ; 


CTGACCTTTG 


(SEQ 


ID 


NO: 410) ; 


GACCTTTGGA 


(SEQ 


ID 


NO: 412) ; 


CCTTTGGATG 


(SEQ 


ID 


NO: 4 14) ; 


TTTGGATGGT 


(SEQ 


ID 


NO: 416) ; 
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CTACCACACA 


(SEQ 


ID 


NO: 343) ; 


ACCACACACA 


(SEQ 


ID 


NO: 345) ; 


CACACACAAG 


(SEQ 


ID 


NO: 347) ; 


CACACAAGGC 


(SEQ 


ID 


NO: 349) ; 


CACAAGGCTA 


(SEQ 


ID 


NO:351) ; 


CAAGGCTACT 


(SEQ 


ID 


NO: 353) ; 


AGGCTACTTC 


(SEQ 


ID 


NO:355) ; 


GCTACTTCCC 


(SEQ 


ID 


NO:357) ; 


TACTTCCCTG 


(SEQ 


ID 


NO: 359) ; 


CTTCCCTGAT 


(SEQ 


ID 


NO:361) ; 


TCCCTGATTG 


(SEQ 


ID 


NO: 363) ; 


CCTGATTGGC 


(SEQ 


ID 


NO: 365); 


TGATTGGCAG 


(SEQ 


ID 


NO: 367) ; 


ATTGGCAGAA 


(SEQ 


ID 


NO:369) ; 


TGGCAGAACT 


(SEQ 


ID 


NO: 371) ; 


GCAGAACTAC 


(SEQ 


ID 


NO: 373) ; 


AGAACTACAC 


(SEQ 


ID 


NO: 375) ; 


AACTACACAC 


(SEQ 


ID 


NO: 377) ; 


CTACACACCA 


(SEQ 

\ ****** 


ID 


NO: 379) ; 


APAPACCAGG 


(SEO 


ID 


NO: 381) : 


APAPPAGGGC 


(SEO 


ID 


NO: 383) ; 


ACCAGGGCCA 


(SEQ 


ID 


NO:385) / 


CAGGGCCAGG 


(SEQ 


ID 


NO: 387) ; 


GGGCCAGGGG 


(SEQ 


ID 


NO:389) ; 


GCCAGGGGTC 


(SEQ 


ID 


NO:391) ; 


CAGGGGTCAG 


(SEQ 


ID 


NO:393) ; 


GGGGTCAGAT 


(SEQ 


ID 


NO: 395); 


GGTCAGATAT 


(SEQ 


ID 


NO: 397) ; 


TCAGATATCC 


(SEQ 


ID 


NO:399) ; 


AGATATCCAC 


(SEQ 


ID 


NO: 401) ; 


ATATCCACTG 


(SEQ 


ID 


NO: 4 03) ; 


ATCCACTGAC 


(SEQ 


ID 


NO:405) ; 


CCACTGACCT 


(SEQ 


ID 


NO:407) ; 


ACTGACCTTT 


(SEQ 


ID 


NO:409) ; 


TGACCTTTGG 


(SEQ 


ID 


NO: 411} ; 


ACCTTTGGAT 


(SEQ 


ID 


NO: 413) ; 


CTTTGGATGG 


(SEQ 


ID 


NO:415) ; 


TTGGATGGTG 


(SEQ 


ID 


NO:417) ; 
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TGGATGGTGC (SEQ ID NO: 418) ; 

• QATGGTGCTA (SEQ ID NO: 420); 
TGGTOCTACA (SEQ ID NO:422) ; 
GTGCTACAAG (SEQ ID NO: 424) ; 
GCTACAAGCT (SEQ ID NO: 42 6) ; 
TACAAGCTAG (SEQ ID" NO: 428); 
CAAGCTAGTA (SEQ ID NO:430) ; 

AGCTAGTACC (SEQ ID NO: 432) ; 

CTAQTACCAG (SEQ ID NO:434); 

AGTACCAGTT (SEQ ID NO: 436) ; 

TACCAGTTGA (SEQ ID NO: 438); 

CCAGTTGAGC (SEQ ID NO: 440) ; 

AGTTGAGCCA (SEQ ID NO: 442); 

TTGAGCCAGA (SEQ ID NO: 444) ; 

GAGCCAGATA (SEQ ID NO:446) ; 

GCCAGATAAG (SEQ ID NO: 448) 

CAGATAAGGT (SEQ ID NO: 450) ; 

GATAAGGTAG (SEQ ID NO: 452) ; 

TAAGGTAGAA (SEQ ID NO: 454) ; 

AGGTA6AAGA (SEQ ID NO: 456) ; 

GTAGAAGAGG (SEQ ID NO:458) ; 

AGAAGAGGCC (SEQ ID NO: 460) ; 

AAGAGGCCAA (SEQ ID NO: 4 62); 

GAGGCCAATA (SEQ ID NO: 464); 

GGCCAATAAA (SEQ ID NO: 466) ; 

CCAATAAAGG (SEQ ID NO: 468) ; 
AATAAAGGAG (SEQ ID NO:470) ; 
TAAAGGAGAG (SEQ ID NO: 472) ; 
AAGGAGAGAA (SEQ ID NO: 474) ; 
GGAGAGAACA (SEQ ID NO:476) ; 
AGAGAACACC (SEQ ID NO: 478) ; 
AGAACACCAG (SEQ ID NO: 480) ; 
AACACCAGCT (SEQ ID NO: 482) ; 
CACCASCTTG (SEQ ID NO: 484) ; 
CCAGCTTGTT (SEQ ID NO:486) ; 
AGCTTGTTAC (SEQ ID NO:488) ; 
CTTGTTACAC (SEQ ID NO: 490) ; 
TGTTACaCCC (SEQ ID NO: 492) ; 
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GGATGGTGCT (SEQ ID NO: 4 19) ; 
ATGGTGCTAC (SEQ ID NO: 421) ; 
GGTGCTACAA (SEQ ID NO: 423) ; 
TGCTACAAGC (SEQ ID NO:425) ; 
CTACAAGCTA (SEQ ID NO: 427) ; 
ACAAGCTAGT (SEQ ID NO: 429} ; 
AAGCTAGTAC (SEQ ID NO: 431) ; 
GCTAGTACCA (SEQ ID NO:433); 
TAGTACCAGT (SEQ ID NO: 435) ; 
GTACCAGTTG (SEQ ID NO:437); 
ACCAGTTGAG (SEQ ID NO: 439) ; 
CAGTTGAGCC (SEQ ID NO: 441); 
GTTGAGCCAG (SEQ ID NO: 443) ; 
TGAGCCAGAT (SEQ ID NO:445) ; 
AGCCAGATAA (SEQ ID NO: 447) ; 
CCAGATAAGG (SEQ ID NO: 449} ; 
AGATAAGGTA (SEQ ID NO: 451) ; 
ATAAGGTAGA (SEQ ID NO: 453) ; 
AAGGTAGAAG (SEQ ID N0:455); 
GGTAGAAGAG (SEQ ID NO: 457) ; 
TAGAAGAGGC (SEQ ID NO: 459) ; 
GAAGAGGCGA (SEQ ID NO:46l) ; 
AGAGGCCAAT (SEQ ID NO:463) ; 
AGGCCAATAA (SEQ ID NO: 465) ; 
GCCAATAAAG (SEQ ID NO:467) ; 
CAATAAAGGA (SEQ ID NO: 469) ; 
ATAAAGGAGA (SEQ ID NO: 471) ; 
AAAGGAGAGA (SEQ ID NO: 473} ; 
AGGAGAGAAC (SEQ ID NO: 475) ; 
GAGAGAACAC (SEQ ID NO:477) ; 
GAGAACACCA (SEQ ID NO:479) ; 
GAACACCAGC (SEQ ID NO:481) ; 
ACACCAGCTT {SEQ ID NO: 483) ; 
ACCAGCTTGT (SEQ ID NO: 485) ; 
CAGCTTGTTA (SEQ ID NO: 487) ; 
GCTTGTTACA (SEQ ID NO: 489) ; 
TTGTTACACC (SEQ ID NO: 491) ; 
GTTACACCCT (SEQ ID NO: 493) ; 
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TTACACCCTG 


(SEQ 


ID 


NO: 494 


ACACCCTGTG 


(SEQ 


ID 


NO: 496 


ACCCTQTGAG 


(SEQ 


ID 


NO: 498 


CCTGTGAGCC 


(SEQ 


ID 


NO: 500 


TGTGAGCCTG 


(SEQ 


ID 


NO: 502 


TGAGCCTGCA 


(SEQ 


ID 


NO: 504 


AGCCTGCATG 


(SEQ 


ID 


NO: 506 


CCTGCATGGA 


(SEQ 


ID 


NO: 508 


TGCATGGAAT 


(SEQ 


ID 


NO: 510 


CATGGAATGG 


(SEQ 


ID 


NO: 512 


TGGAATGGAT 


(SEQ 


ID 


NO: 514 


GAATGGATGA 


(SEQ 


ID 


NO: 516] 


ATGGATGACC 


(SEQ 


ID 


NO: 518] 


GOATGACCCT 


(SEQ 


ID 


NO: 520] 


ATGACCCTGA 


(SEQ 


ID 


NO: 522] 


GACCCTGAGA 


(SEQ 


ID 


NO: 524) 


CCCTGAGAGA 


(SEQ 


ID 


NO: 526) 


CTGAGAGAGA 


(SEQ 


ID 


NO: 528) 


GAGAGAGAAG 


(SEQ 


ID 


NO: 530) 


GAGAGAAGTG 


(SEQ 


ID 


NO: 532) 


GAGAAGTGTT 


(SEQ 


ID 


NO: 534) 


GAAGTGTTAG 


(SEQ 


ID 


NO: 536) 


AGTGTTAGAG 


(SEQ 


ID 


NO: 538) 


TGTTAGAGTG 


(SEQ 


ID 


NO: 540} 


TTAGAGTGGA 


(SEQ 


ID 


NO: 542) 


AGAGTGGAGG 


(SEQ 


ID 


NO: 544) 


AGTGGAGGTT 


(SEQ 


ID 


NO: 546) 


TGGAGGTTTG 


(SEQ 


ID 


NO: 548) 


GAGGTTT6AC 


(SEQ 


ID 


NO: 550) 


GGTTTGACAG 


(SEQ 


ID 


NO: 552) 


TTTGACAGCC 


(SEQ 


ID 


NO: 554) 


TGACAGCCGC 


(SEQ 


ID 


NO: 556) 


ACAGCCGCCT 


(SEQ 


ID 


NO: 558) 


AGCCOCCTAQ 


(SEQ 


ID 


NO: 560) 


CCGCCTAGCA 


(SEQ 


ID 


NO: 562) 


GCCTAGCATT 


(SEQ 


ID 


NO: 564) 


CTAGCATTTC 


(SEQ 


ID 


NO: 566) 


AGCATTTCAT 


(SEQ 


ID 


NO: 568) 



TACACCCTOT 


(SEQ 


ID 


NO:495) 


CACCCTGTGA 


(SEQ 


ID 


NO: 497) 


CCCTGTGAGC 


. (SEQ 


ID 


NO:499) 


CTGT6A6CCT 


(SEQ 


ID 


NO: 501) 


GTGAGCCTGC 


(SEQ 


ID 


NO: 503) 


GAGCCTGCAT 


(SEQ 


ID 


NO: 505) , 


GCCTGCATGG 


(SEQ 


ID 


NO: 507) , 


CTGCATGGAA 


(SEQ 


ID 


NO: 509) , 


GCATGGAAT6 


(SEQ 


ID 


NO: 511) , 


ATGGAATGGA 


(SEQ 


ID 


NO: 513) , 


GGAATGGATG 


(SEQ 


ID 


NO: 515) ; 


AATGGATGAC 


(SEQ 


ID 


NO: 517} ; 


TGGATGACCC 


(SEQ. 


ID 


NO:519) ; 


GATGACCCTG 


(SEQ 


ID 


NO: 521) ; 


TGACCCTGA6 


(SEQ 


ID 


NO: 523) ; 


ACCCTGAGAG 


(SEQ 


ID 


NO: 525) ; 


CCTGAGAGAG 


(SEQ 


ID 


NO: 527) ; 


TGAGAGAGAA 


(SEQ 


ID 


NO: 529) ; 


AGAGAGAAGT 


(SEQ 


ID 


NO: 531) ; 


AGAGAAGTGT 


(SEQ 


ID 


NO: 533) ; 


AGAAGTGTTA 


(SEQ 


ID 


NO: 535) ; 


AAGTGTTAGA 


(SEQ 


ID 


NO:537) ; 


GTGTTAGAGT 


(SEQ 


ID 


NO:539) ; 


GTTAGAGTGG 


(SEQ 


ID 


NO: 541) ; 


TAGAGTGGAG 


(SEQ 


ID 


NO: 543) ; 


GAGTGGAGGT 


(SEQ 


ID 


NO: 545) ; 


GTGGAGGTTT 


(SEQ 


ID 


NO: 547) ; 


GGAGGTTTGA 


(SEQ 


ID 


NO: 549) ; 


AGGTTTGACA 


(SEQ 


ID 


NO: 551) ; 


GTTTGACAGC 


(SEQ 


ID 


NO: 553) ; 


TTGACAGCCG 


(SEQ 


ID 


NO: 555) ; 


GACAGCOGCC 


(SEQ 


ID 


NO: 557) ; 


CAGCGGCCTA 


(SEQ 


ID 


NO: 559) ; 


GCCGCCTAGC 


(SEQ 


ID 


NO: 561) ; 


CGCCTAGCAT 


(SEQ 


ID 


NO: 563) ; 


CCTAGCATTT 


(SEQ 


ID 


NO: 565) ; 


TAGCATTTCA 


(SEQ 


ID 


NO: 567} ; 


GCATTTCATC 


(SEQ 


ID 


NO: 569) ; 
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CATTTCATCA 


(SEQ 


ID 


NO: 570) ; 


ATTTCATCAC 


(SEQ 


ID 


NO: 571) , 


TTTCATCACG 


(SEQ 


ID 


NO: 572) ; 


TTCATCACGT 


(SEQ 


ID 


NO:573) , 


TCATCACGTG 


(SEQ 


ID 


NO: 574) ; 


CATCACGTGG 


(SEQ 


ID 


NO: 575) , 


ATCACGTGGC 


(SEQ 


ID 


NO: 576) ; 


TCACGTGGCC 


(SEQ 


ID 


NO:577) , 


CACGTGGCCC 


(SEQ 


ID 


NO: 578); 


ACGTGGCCCG 


(SEQ 


ID 


NO:579) , 


CGTGGCCCGA 


(SEQ 


ID 


NO: 580) ; 


GT6GCCCGA6 


(SEQ 


ID 


NO: 581) ; 


TGGCCCGAGA 


(SEQ 


ID 


NO: 582) ; 


6GCCCGAGA6 


(SEQ 


ID 


NO: 583) ; 


GCCCGAGAGC 


(SEQ 


ID 


NO: 584) ; 


CCCGAGAGCT 


(SEQ 


ID 


NO: 585) ; 


CCGAGAGCTG 


(SEQ 


ID 


NO.-586) ; 


CGAGAGCTGC 


(SEQ 


ID 


NO: 587) ; 


GAGAGCTGCA 


(SEQ 


ID 


NO: 588) ; 


AGAGCTGCAT 


(SEQ 


ID 


NO: 589) ; 


GAGCTGCATC 


(SEQ 


ID 


NO:590); 


AGCTGCATCC 


(SEQ 


ID 


NO: 591) ; 


GCTGCATCCG 


(SEQ 


ID 


NO: 592) ; 


CTGCATCCGG 


(SEQ 


ID 


NO: 593) ; 


TGCATCCGGA 


(SEQ 


ID 


NO: 594) ; 


GCATCCGGAG 


(SEQ 


ID 


NO: 595) ; 


CATCOGGAGT 


(SEQ 


ID 


NO: 596); 


ATCCGGAGTA 


(SEQ 


ID 


NO:597) ; 


TCCGGAGTAC 


(SEQ 


ID 


NO:598) ; 


CCGGAGTACT 


(SEQ 


ID 


NO: 599) ; 


CGGAGTACTT 


(SEQ 


ID 


NO:600) ; 


GGAGTACTTC 


(SEQ 


ID 


NO: 601) ; 


GAGTACTTCA 


(SEQ 


ID 


NO: 602) ; 


AGTACTTCAA 


(SEQ 


ID 


NO: 603) ; 


GTACTTCAAG 


(SEQ 


ID 


NO: 604) ; 


TACTTCAAGA 


(SEQ 


ID 


NO: 605) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO: 606) ; 


CTTCAAGAAC 


(SEQ 


ID 


NO: 607) ; 


TTCAAGAACT 


(SEQ 


ID 


NO: 608) ; 


TCAAGAACTG 


(SEQ 


ID 


NO: 609) ; 


CAAGAACTGC 


(SEQ 


ID 


NO: 610) ; 


AAGAACTGCT 


(SEQ 


ID 


NO: 611) ; 


AGAACTGCTG 


(SEQ 


ID 


NO: 612) ; 


GAACTGCTGA 


(SEQ 


ID 


NO:613) . 



85. A strain of HTV-1 according to claim 79 wherein said deletion encompasses at 
least one of the following decanncleotides from the LTR region of HTV-1 strain NL4-3: 



GCTTTTTGCC 


(SEQ 


ID 


NO: 652) ; 


CTTTTTGCCT 


(SEQ 


ID NO: 653) ; 


TTTTTGCCTG 


(SEQ 


ID 


NO: 654) ; 


TTTTGCCTGT 


(SEQ 


ID NO:655); 


TTTGCCTGTA 


(SEQ 


ID 


NO: 656) ; 


TTGCCTGTAC 


(SEQ 


ID NO: 657) ; 


TGCCTGTACT 


(SEQ 


ID 


NO:658) ; 


GCCTGTACTG 


(SEQ 


ID NO: 659) ; 


CCTGTACTGG 


(SEQ 


ID 


NO:660) ; 


CTGTACTGGG 


(SEQ 


ID NO: 661) ; 


TGTACTGGGT 


(SEQ 


ID 


NO: 6 62) ; 


GTACTGGGTC 


(SEQ 


ID NO:663) ; 


TACTGGGTCT 


(SEQ 


ID 


NO: 664) ; 


ACTGGGTCTC 


(SEQ 


ID NO:665) ; 


CTGGGTCTCT 


(SEQ 


ID 


NO: 666) ; 


TGGGTCTCTC 


(SEQ 


ID NO: 667) ; 


GGGTCTCTCT 


(SEQ 


ID 


NO: 668) ; 


GGTCTCTCTG 


(SEQ 


ID NO: 669) ; 


GTCTCTCTGG 


(SEQ 


ID 


NO: 670) ; 


TCTCTCTGGT 


(SEQ 


ID NO: 671) ; 


CTCTCTGGTT 


(SEQ 


ID 


NO:672) ; 


TCTCTGGTTA 


(SEQ 


ID NO:673); 


CTCTGGTTAG 


(SEQ 


ID 


NO: 674) ; 


TCTCTGGTTA 


(SEQ 


ID NO: 6 75) ; 
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CTGGTTAGAC 


(SEQ 


ID 


NO: 676) 


; TGGTTAGACC 


(SEQ 


ID 


NO: 677) , 


GGTTAGACCA 


(SEQ 


ID 


NO: 678) 


; GTTAGACCAG 


(SEQ 


ID 


NO: 679) , 


TTAGACCAQA 


(SEQ 


ID 


NO: 680) 


; TAGACCAGAT 


(SEQ 


ID 


NO: 681) , 


AGACCAGATC 


(SEQ 


ID 


NO: 682) 


; GACCAGATCT 


(SEQ 


ID 


NO: 683) ; 


ACCAOATCTG 


(SEQ 


ID 


NO: 684) 


; CCAGATCTGA 


(SEQ 


ID 


NO:685) ; 


CAGATCTGAG 


(SEQ 


ID 


NO: 666) 


; AGATCTGAGC 


(SEQ 


ID 


NO: 687) ; 


GATCTGAGCC 


(SEQ 


ID 


NO: 688) 


; ATCTGAGCCT 


(SEQ 


ID 


NO: 689} ; 


TCTGAGCCTG 


(SEQ 


ID 


NO:690) 


i CTGAGCCTGG 


(SEQ 


ID 


NO: 691) ; 


TGAGCCTGGG 


(SEQ 


ID 


NO: 6 92) 


; GAGCCTGGGA 


(SEQ 


ID 


NO: 693) ; 


AGCCTGGGAG 


(SEQ 


ID 


NO: 694) 


; GCCTGGGAGC 


(SEQ 


ID 


NO: 695) ; 


CCTGGGAGCT 


(SEQ 


ID 


NO: 696) , 


; CTGGGAGCTC 


(SEQ 


ID 


NO: 697) ; 


TGGGAGCTCT 


(SEQ 


ID 


NO:698) , 


; G06AGCTCTC 


(SEQ 


ID 


NO: 699) ? 


GGAGCTCTCT 


(SEQ 


ID NO: 700) ; 


? GAGCTCTCTG 


{SEQ 


ID 


NO: 701) ; 


AGCTCTCTGG 


(SEQ 


ID 


NO: 702) ; 


■ 6CTCTCT66C 


(SEQ 


ID 


NO: 703) ; 


CTCTCTGGCT 


(SEQ 


ID NO s 704) i 


TCTCTGGCTA 


(SEQ 


ID 


NO: 705) ; 


CTCTGGCTAA 


(SEQ 


ID NO:706) , 


■ TCTGGCTAAC 


(SEQ 


ID 


NO: 707) ; 


CTGGCTAACT 


(SEQ 


ID NO: 708); TGGCTAACTA 


(SEQ 


ID 


NO: 709) ; 


GGCTAACTAG 


(SEQ 


ID NO: 710} ; GCTAACTAGG 


(SEQ 


ID 


NO: 711) ; 


CTAACTAGGG 


(SEQ 


ID 


NO : 712 ) ; TAACTAGGGA 


(SEQ 


ID 


NO: 713) ; 


AACTAGGGAA 


(SEO 


ID 


NO: 714) ; ACTAGGGAAC 


(SEQ 


ID 


NO: 715) ; 


CTAGGGAACC 


(SEO 


ID 


NO: 716) ; TAGGGAACCC 


(SEQ 


ID 


NO: 717) ; 


AGGGAACCCA 


(SEQ 


ID 


NO: 718) ; 


GGGAACCCAC 


(SEQ 


ID 


NO: 719) ; 


GGAACCCACT 


(SEQ 


ID NO: 720) ; GAACCCACTG 


(SEQ 


ID 


NO: 721) ; 


AACCCACTGC 


(SEQ 


ID NO: 722) ; ACCCACTOCT 


(SEQ 


ID 


NO: 723) ; 


CCCACTGCTT 


(SEQ 


ID 


NO : 724 ) ; CCACTGCTTA 


(SEQ 


ID 


NO: 725) ; 


CACTGCTTAA 


(SEQ 


ID 


NO: 726) ; 


ACTGCTTAAG 


(SEQ 


ID 


NO: 727) } 


CTGCTTAAGC 


(SEQ 


ID 


NO: 728) ; 


TGCTTAAGCC 


(SEQ 


ID 


NO: 729) ; 


GCTTAAGCCT 


(SEQ 


ID 


NO: 730) ; 


CTTAAGCCTC 


(SEQ 


ID 


NO:731) ; 


TTAAGCCTCA 


(SEQ 


ID 


NO: 732) ; 


TAAGCCTCAA 


(SEQ 


ID 


NO: 733) ; 


AAGCCTCAAT 


(SEQ 


ID 


NO: 734) ; 


AGCCTCAATA 


(SEQ 


ID 


NO:735) ; 


GCCTCAATAA 


(SEQ 


ID 


NO: 736) ; 


CCTCAATAAA 


(SEQ 


ID 


NO: 737) ; 


CTCAATAAAG 


(SEQ 


ID NO:738) ; 


TCAATAAAGC 


(SEQ 


ID 


NO: 739) / 


CAATAAAGCT 


(SEQ 


ID 


NO: 740) ; 


AATAAAGCTT 


(SEQ 


ID 


NO: 741) ; 


ATAAAGCTTG 


(SEQ 


ID 


NO: 742) ; 


TAAAGCTTGC 


(SEQ 


ID 


NO: 743) ; 


AAAGCTTGCC 


(SEQ 


ID NO: 744) ; 


AAGCTTGCCT 


(SEQ 


ID 


NO: 745) ; 


AGCTTGCCTT 


(SEQ 


ID 


NO: 746) j GCTTGCCTTG 


(SEQ 


ID 


NO: 747) ; 


CTTGCCTTGA 


(SEQ 


ID 


NO: 748) ; TTGCCTTGAG 


(SEQ 


ID 


NO: 749) ; 


TGCCTTGAGT 


(SEQ 


ID 


NO : 750 ) ; GCCTTGAGTG 


(SEQ 


ID 


NO: 751) ; 
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CCTTGAGTGC 

TTGAGTGCTT 

GA6T6CTTCA 

GTGCTTCAAG 

GCTTCAAGTA 

TTCAAGTAGT 

CAAGTAGTGT 

AGTAGTGTGT 

TA6TGTGTGC 

6TGTGTGCCC 

GTGTGCCCGT 

GTGCCCGTCT 

GCCCGTCTGT 

CCGTCTGTTG 

GTCTGTTGTG 

CTGTTGTGTG 

GTTGTGTGAC 

TGTGTGACTC 

TGTGTGACTC 

TGTGACTCTG 

TGACTCTGGT 

ACTCTGGTAA 

TCTGGTAACT 

TGGTAACTAG 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 752 
NO: 754 
NO: 756 
NO: 758 
NO: 760 
NO: 762 
NO: 764 
NO: 766 
NO: 768 
NO: 770 
NO : 772 
NO: 774 
NO: 776 
NO j 778 
NO: 780 
NO: 782 
NO: 784 
NO: 786 
NO: 788 
NO: 790 
NO: 792 
NO: 794 
NO: 796 
NO: 79 8 



CTTGAGTGCT 

TGAGTGCTTC 

AGTGCTTCAA 

TGCTTCAAGT 

CTTCAAGTAG 

TCAAGTAGTG 

AAGTAGTGTG 

GTAGTGTGTG 

AGTGTGTGCC 

TGTGTGCCCG 

TGTGCCOGTC 

TGCCCGTCTG 

CCCGTCTGTT 

CGTCTGTTGT 

TCTGTTGTGT 

TGTTGTGTGA 

TTGTGTGACT 

GTGTGACTCT 

GTGTGACTCT 

GTGACTCTGG 

GACTCTGGTA 

CTCTGGTAAC 

CTGGTAACTA 

GGTAACTAGA 



(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 
(SEQ ID 



NO: 753) 
NO:7S5) 
NO: 75 7) 
NO: 759) 
NO: 761) 
NO: 763) 
NO: 765) 
NO: 767) 
NO: 769) 
NO: 771) 
NO: 773) 
NO: 775) 
NO: 777) 
NO: 779) ; 
NO: 781} 
NO: 783) 
NO:785) 
NO: 7 87) 
NO: 789) 
NO: 791) 
NO: 793) 
NO: 795) 
NO: 797) 
NO:799) . 



86. A method for determining the pathogenicity of an HTV-1 strain after said HTV-1 
strain infects cells of an individual, said method comprising determining the presence 
of a deletion mutation in its genome of at least 10 nucleotides within the region from 
nucleotide 8787 to nucleotide 9709 using the nucleotide numbering of HTV-1 strain 
NL4-3 wherein the presence of said deletion mutation is indicative of the presence of 
a non-pathogenic strain of HTV-1. 



87. A method according to claim 86 wherein said non-pathogenic HTV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 
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nucleotide (i) 8830-8862; 

(ii) 9009-9035; 

(iii) 9019-9029; and 

(iv) 9033-9049. 



88. A method according to claim 86 wherein said non-pathogenic HTV-1 carries a 
deletion in its genome of at least 1 0 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (v) 9281-9371; 

(vi) 9281-9362; 

(vii) 9105-9224; and 
(viu) 9271-9370. 

89. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide Qx) 8882-8928; 

(x) 8850-9006; 

(xi) 8792-9041; and 
(xii") 9112-9204. 

90. A method according to claim 86 wherein said non-pathogenic HIV-1 carries a 
deletion in its genome of at least 10 nucleotides from within a region selected from the 
list consisting of: 

nucleotide (xui) 9105-9224; 

(xiv) 9389-9395; and 

(xv) 9281-9366. 

91. A strain of HIV-1 according to claim 86 wherein said deletion encompasses at 
least one of the following decanucleotides from the nef gene of HIV-1 strain NL4-3: 
ATGGGTGGCA (SEQ ID NO:2) ; TGGGTGGCAA (SEQ ID NO:3) ; 
GGGTGGCAAG (SEQ ID NO: 4) ; GGTGQCAAGT . (SEQ ID NO:5) ; 
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GTGGCAAGTG (SEQ ID NO: 6) ; 
GGCAAGTGGT (SEQ ID NO:8); 
CAAGTGGTCA (SEQ ID NO: 10} ; 
AGTGGTCAAA (SEQ ID NO:12); 
TGGTCAAAAA (SEQ ID N0:14) ; 
GTGAAAAAGT (SEQ ID NO:16) ; 
CAAAAAGTAG (SEQ ID NO: 18) ; 

AAAAGTAGTG (SEQ ID NO: 20); 

AAGTAGTGTG (SEQ ID NO: 22); 

GTAGTGTGAT (SEQ ID NO: 24) / 

AGTGTGATTG (SEQ ID NO: 26) ; 

TGTGATTGGA (SEQ ID NO:28); 

TGATTGGATG (SEQ ID NO:30); 

ATTGGATGGC (SEQ ID NO: 32) ; 

TGGATGGCCT (SEQ ID NO:34) ; 

GATGGCCTGC (SEQ ID NO:36); 

TGGCCTGCTG (SEQ ID NO:38) ; 

GCCTGCTGTA (SEQ ID NO: 40) ; 

CTGCTGTAAG (SEQ ID NO: 42); 

GCTGTAAGGG (SEQ ID N0:44); 

TGTAAGGGAA (SEQ ID NO: 46) ; 

TAAGGGAAAG (SEQ ID NO:48) ; 
AGGGAAAGAA (SEQ ID NO:S0); 
GGAAAGAATG (SEQ ID NO:52); 
AAAGAATGAG (SEQ ID NO:54); 
AGAATGAGAC (SEQ ID NO:56); 
AATGAGACGA (SEQ ID NO:58); 
TGAGACGAGC (SEQ ID NO:60); 
AGACGAGCTG (SEQ ID NO: 62); 
ACGAGCTGAG (SEQ ID NO:64); 
GAGCTGAGCC (SEQ ID NO: 66); 
GCTGAGCCAG (SEQ ID NO:68) ; 
TGAGCCAGCA (SEQ ID NO:70); 
AGCCAGCAGC (SEQ ID NO:72) ; 
CCAGCAGCAG (SEQ ID NO: 74); 
AGCAGCAGAT (SEQ ID NO: 76) ; 
CAGCAGATGG (SEQ ID NO: 78) ; 
GCAGATGGGG (SEQ ID NO: 60) ; 



TGGCAAGTGG (SEQ ID NO: 7); 



GCAAGTGGTC 


(SEQ ID NO: 9) ; 


AAGTGGTCAA 


(SEQ 


ID 


NO: 11) ; 


^ GTGGTCAAAA 


(SEQ 


ID 


NO:13) ; 


GGTCAAAAAG 


(SEQ 


ID 


NO: 15) ; 


TCAAAAAGTA 


(SEQ 


ID 


NO: 17) ; 


AAAAAGTAGT 


(SEQ ID HO: 19) ; 


AAAGTAGTGT 


(SEQ 


ID 


NO:21) ; 


AGTAGTGTGA 


(SEQ 


ID 


NO:23) ; 


TAGTGTGATT 


(SEQ ID NO: 25) ; 


GTGTGATTGG 


(SEQ 


ID 


NO: 27) ; 


GTGATTGGAT 


(SEQ 


ID 


NO: 2 9) ; 


GATTGGATGG 


(SEQ 


ID 


NO:31) ; 


TTGGATGGCC 


(SEQ 


ID 


NO:33); 


GGATGGCCTG 


(SEQ 


ID 


NO:35); 


ATG6CCT6CT 


(SEQ 


ID 


NO:37); 


6GCCTGCTGT 


(SEQ 


ID 


NO: 39) ; 


CCTGCTGTAA 


(SEQ 


ID 


NO: 41) ; 


TOCTGTAAGG 


(SEQ 


ID 


NO:43) ; 


CTGTAAGGGA 


(SEQ 


ID 


NO: 45) ; 


GTAAGGGAAA 


(SEQ 


ID . 


NO: 47) ; 


AAGGGAAAGA 


(SEQ 


ID 


N0:49) ; 


GGGAAAGAAT 


(SEQ 


ID 


NO: 51); 


GAAAGAATGA 


(SEQ 


ID 


NO: 53) ; 


AAGAATGAGA 


(SEQ 


ID 


NO:55) ; 


GAATGAGACG 


(SEQ 


ID 


NO: 57) ; 


ATGAGACGAG 


(SEQ 


ID 


NO: 59) ; 


GAGACGAGCT 


(SEQ 


ID 


NO: 61) ; 


GACGAGCTGA 


(SEQ 


ID 


NO: S3) ; 


CGAGCTGAGC 


(SEQ 


ID 


NO: 65) ; 


AGCTGAGCCA 


(SEQ 


ID 


NO: 67) ; 


CTGAGCCAGC 


(SEQ 


ID 


NO: 69) ; 


GAGCCAGCAG 


(SEQ 


ID 


NO: 71) ; 


GCCAGCAGCA 


(SEQ 


ID 


NO: 73) ; 


CAGCAGCAGA 


(SEQ 


ID 


NO:75); 


GCAGCAGATG 


(SEQ 


ID 


NO:77); 


AGCAGATGGG 


(SEQ 


ID 


NO: 79) ; 


CAGATGGGGT 


(SEQ 


ID 


NO: 81) ; 



WO 95/21912 



PCT/AU95/00063 



241 



AGATGGGGTG (SEQ 
ATGGGGTGGG (SEQ 
GGGGTGGGAG (SEQ 
GGTGGGAGCA (SEQ 
TGGGAGCAGT (SEQ 
GGAGCAGTAT (SEQ 
AGCAGTATCT (SEQ 
CAGTATCTCG (SEQ 
GTATCTCGAG (SEQ 
ATCTCGAGAC (SEQ 
CTCGAGACCT (SEQ 
CGAGACCTAG (SEQ 
AGACCTAGAA (SEQ 
ACCTAGAAAA (SEQ 
CTAGAAAAAC (SEQ 
AGAAAAACAT (SEQ 
AAAAACATGG (SEQ 
AAACATGGAG (SEQ 
ACATGGAGCA (SEQ 
ATGGAGCAAT (SEQ 
GGAGCAATCA (SEQ 
AGCAATCACA (SEQ 
CAATCACAAG (SEQ 
ATCACAAGTA (SEQ 
GACAAGTAGC (SEQ 
CAAGTAGCAA (SEQ 
AGTAGCAATA (SEQ 
TAGCAATACA (SEQ 
GCAATACAGC (SEQ 
AATACAGCAG (SEQ 
TACAGCAGCT (SEQ 
CAGCAGCTAA (SEQ 
GCAGCTAACA (SEQ 
AGCTAACAAT (SEQ 
CTAACAATGC (SEQ 
AACAATGCTG (SEQ 
CAATGCTGCT (SEQ 
AT6CTGCTTG (SEQ 



ID NO: 82) ; 
ID NO: 84); 
ID NO: 86) ; 
ID NO: 88) ; 
ID NO: 90) ; 
ID NO: 92) ; 
ID NO: 94) ; 
ID NO: 96) ; 
ID NO: 98) ; 
ID NO: 100) ; 
ID NO: 102) ; 
ID NO: 104); 
ID NO: 106) ; 
ID NO: 108) ; 
ID NO: 110) 
ID NO: 112) 
ID NO: 114) 
ID NO: 116} 
ID NO: 118) 
ID NO: 120} 
ID NO: 122) 
ID NO: 124) 
ID NO: 126) 
ID NO: 128) 
ID NO: 130) 
ID NO: 132); 
ID NO: 134) 
ID NO: 13 6) 
ID NO: 13 8) 
ID NO: 140) ; 
ID NO: 142) 
ID NO: 144) 
ID NO -.146) 
ID NO: 148) 
ID NO: 150) 
ID NO: 152) 
ID NO: 154) 
ID NO: 156) 



GATGGGGTGG 
TGGGGTGGGA 
GOGTGGGAGC 
GTGGGAGCAG 
GGGAGCAGTA 
GAGCAGTATC 
GCAGTATCTC 
AGTATCTCGA 
TATCTCGAGA 
TCTCGAGACC 
TCGAGACCTA 
GAGACCTAGA 
GACCTAGAAA 
CCTAGAAAAA 
TAGAAAAACA 
GAAAAACATG 
AAAACATGGA 
AACATGGAGC 
CATGGAGCAA 
TGGAGCAATC 
GAGCAATCAC 
GCAATCACAA 
AATCACAAGT 
TCACAAGTAG 
ACAAGTAGCA 
AAGTAGCAAT 
GTAGCAATAC 
AGCAATACAG 
CAATACAGCA 
ATACAGCAGC 
ACAGCAGCTA 
AGCAGCTAAC 
CAGCTAACAA 
GCTAACAATG 
TAACAATGCT 
ACAATGCTGC 
AATGCTGCTT 
TGCTGCTTGT 



(SEQ ID NO:83) 
(SEQ ID NO: 85) 
(SEQ ID NO: 87) 
(SEQ ID NO: 89) 
(SEQ ID NO: 91) 
(SEQ ID NO: 93) 
(SEQ ID NO: 95) 
(SEQ ID NO: 97) 
(SEQ ID NO: 99) 
(SEQ ID NO: 101) 
(SEQ ID NO: 103); 
(SEQ ID NO: 105) ; 
(SEQ ID NO: 107) ; 
(SEQ ID NO: 109) ; 
(SEQ ID NO: 111) ; 
(SEQ ID NO: 113) ; 
(SEQ ID NO: 115); 
(SEQ ID NO: 117) ; 
(SEQ ID NO: 119) 
(SEQ ID NO: 121) 
(SEQ ID NO: 123) 
(SEQ ID NO: 125} 
(SEQ ID NO: 127) 
(SEQ ID NO: 129) 
(SEQ ID NO: 131) 
(SEQ ID NO: 133) 
(SEQ ID NO: 135} 
(SEQ ID NO: 137); 
(SEQ ID NO: 139) ? 
(SEQ ID NO: 141) ; 
(SEQ ID NO: 143) 
(SEQ ID NO: 145) 
(SEQ ID NO: 147) 
(SEQ ID NO: 149) 
(SEQ ID NO: 151) 
(SEQ ID NO: 153} 
(SEQ ID NO: 155) ; 
(SEQ ID NO: 157) ; 
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GCTGCTTGTG (SEQ ID NO: 158) 
TGCTTGTGCC (SEQ ID NO: 160) 
CTTGTGCCTG (SEQ ID NO: 162) 
TGTGCCTGGC (SEQ ID NO: 164) 
TGCCTGGCTA (SEQ ID NO: 166) 
CCTGGCTAGA (SEQ ID NO: 168) 
TGGCTAGAAG (SEQ ID NO: 170) 
GCTAGAAGCA (SEQ ID NO: 172) 
TAGAAGCACA (SEQ ID NO: 174) 
GAAGCACAAG (SEQ ID NO: 176) 
AGCACAAGAG (SEQ ID NO: 178) 
CACAAGAGGA (SEQ ID NO: 180) 
CAAGAGGAGG (SEQ ID NO: 182) 
AGAGGAGGAA (SEQ ID NO: 184) 
AGGAGGAAGA (SEQ ID NO: 186) 
GAGGAAGAGG (SEQ ID NO: 188) 
GGAAGAGGTG (SEQ ID NO: 190) 
AAGAGGTGGG (SEQ ID NO: 192) 
GAGGTGGGTT (SEQ ID NO: 194) 
GGTGGGTTTT (SEQ ID NO:196), 
TGGGTTTTCC (SEQ ID NO: 198} , 
GGTTTTCCAG (SEQ ID NO.-200) , 
TTTTCCAGTC (SEQ ID NO: 202) , 
TTCCAGTCAC (SEQ ID NO:204) , 
CCAGTCACAC (SEQ ID NO: 206) ; 
AGTCACACCT (SEQ ID NO: 208) ; 
TCACACCTCA (SEQ ID NO: 210) ; 
ACACCTCAGG (SEQ ID NO: 212); 
ACCTCAGGTA (SEQ ID NO: 214) ; 
CTCAGGTACC (SEQ ID NO: 216) ; 
CAGGTACCTT (SEQ ID NO: 218) ; 
GGTACCTTTA (SEQ ID NO: 220) j 
TACCTTTAAG (SEQ ID NO: 222) ; 
CCTTTAAGAC (SEQ ID NO: 224) ; 
TTTAAGACCA (SEQ ID NO: 226) ; 
TAAGACCAAT (SEQ ID NO: 228) j 
AGACCAATGA (SEQ ID NO:230) ; 
ACCAATGACT (SEQ ID NO: 232) ; 
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CTGCTTGTGC 


(SEQ 


ID 


NO: 159) 


GCTTGTGCCT 


(SEQ 


ID 


NO: 161) 


TTGTGCCTGG 


(SEQ 


ID 


NO: 163) 


GTGCCTGGCT 


(SEQ 


ID 


NO: 165) 


GCCTGGCTAG 


(SEQ 


ID 


NO: 167) 


CTGGCTAGAA 


(SEQ 


ID 


NO: 169) 


GGCTAGAA6C 


{SEQ 


ID 


NO: 171) 


CTAGAAGCAC 


(SEQ 


ID 


NO: 173) 


AGAAGCACAA 


(SEQ 


ID 


NO: 175) 


AAGCACAAGA 


(SEQ 


ID 


NO: 177) , 


GCACAAGAGG 


(SEQ 


ID 


NO: 179) , 


ACAAGAGGAG 


(SEQ 


ID 


NO: 181) ; 


AAGAGGAGGA 


(SEQ 


ID 


NO: 183) ; 


GAGGAGGAAG 


(SEQ 


ID 


NO: 185) j 


GGAGGAAGAG 


(SEQ 


ID 


NO: 187) ; 


AGGAAGAGGT 


(SEQ 


ID 


NO: 189) ; 


GAAGAGGTGG 


(SEQ 


ID 


NO: 191) ; 


AGAGGTGGGT 


(SEQ 


ID 


NO: 193) ; 


AGGTGGGTTT 


(SEQ 


ID 


NO: 195) ; 


GTGGGTTTTC 


(SEQ 


ID 


NO: 197) ; 


GGGTTTTCCA 


(SEQ 


ID 


NO: 199) ; 


GTTTTCCAGT 


(SEQ 


ID 


NO: 201) ; 


TTTCCAGTCA 


(SEQ 


ID 


NO: 203) ; 


TCCAGTCAGA 


(SEQ 


ID 


NO:205) ; 


CAGTCACACC 


(SEQ 


ID 


NO: 207) ; 


GTCACACCTC 


(SEQ 


ID 


NO: 209) ; 


CACACCTCAG 


(SEQ 


ID 


NO: 211) ; 


CACCTCAGGT 


(SEQ 


ID 


NO: 213) ; 


CCTCAGGTAC 


(SEQ 


ID 


NO: 215) ; 


TCAGGTACCT 


(SEQ 


ID 


NO: 217) ; 


AGGTACCTTT 


(SEQ 


ID 


NO: 219) ; 


GTACCTTTAA 


(SEQ 


ID 


NO:221) ; 


ACCTTTAAGA 


(SEQ 


ID 


NO: 223) ; 


CTTTAAGACC 


(SEQ 


ID 


NO:225) ; 


TTAAGACCAA 


(SEQ 


ID 


NO: 227) ; 


AAGACCAATG 


(SEQ 


ID 


NO: 229) ; 


GACCAATGAC 


(SEQ 


ID 


NO: 231) ; 


CCAATGACTT 


(SEQ 


ID 


NO: 233) ; 
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CAATGACTTA (SEQ ID NO: 234) ; 
ATGACTTACA ( SEQ ID NO : 23 6 ) ; 
GACTTACAAG (SEQ ID NO: 238) ; 
CTTACAAGGC (SEQ ID NO:240); 
TACAAGGCAG (SEQ ID NO: 242) ; 
CAAGGCAGCT (SEQ ID N0:244>; 
AGGCAGCTGT (SEQ ID NO: 246) ; 
6CA6CTGTAQ (SEQ ID NO: 248) ; 
AGCTGTAGAT (SEQ ID NO: 250); 
CTGTAGATCT (SEQ ID NO: 2 52); 
GTAGATCTTA (SEQ ID NO: 2 54); 
AGATCTTAGC (SEQ ID NO:256) ; 
ATCTTAGCCA (SEQ ID NO: 258) ; 
CTTAGCCACT (SEQ ID NO: 260); 
TAGCCACTTT (SEQ ID NO:262); 
GCCACTTTTT (SEQ ID NO:264) ; 
CACTTTTTAA (SEQ ID NO:266) ; 
CTTTTTAAAA (SEQ ID NO: 268); 
TTTTAAAAGA (SEQ ID NO: 270); 
TTAAAAQAAA (SEQ ID NO:272) ; 
AAAAGAAAAG (SEQ ID NO: 274) ; 
AAGAAAAGGG (SEQ ID NO:276); 
GAAAAGGGGG (SEQ ID NO: 278) ; 
AAAGGGGGGA (SEQ ID NO -.280) ; 
AGGGGGGACT (SEQ ID N0:282) ; 
GGGGGACTGG (SEQ ID N0:284) ; 
GGGACTGGAA (SEQ ID NO:286) ; 
GACTGGAAGG (SEQ ID N0:28B) ; 
CTGGAAGGGC (SEQ ID NO: 290) ; 
GGAAGGGCTA (SEQ ID NO: 292) ; 
AAGGGCTAAT (SEQ ID NO: 294) ; 
GGGCTAATTC (SEQ ID NO: 296) ; 
GCTAATTCAC (SEQ ID NO: 298) ; 
TAATTCACTC (SEQ ID NO: 300) ; 
ATTCACTCCC (SEQ ID NO: 302) ; 
TCACTCCCAA (SEQ ID NO: 304) ; 
ACTCCCAAAG (SEQ ID NO: 306) 7 
TCCCAAAGAA (SEQ ID NO:308) ; 



-243- 



AATGACTTAC 


(SEQ 


ID 


NO: 235 


TGACTTACAA 


(SEQ 


ID 


NO: 237 


ACTTACAAGG 


(SEQ 


ID 


N0:239 


TTACAAG6CA 


(SEQ 


ID 


NO: 241 


ACAA66CAGC 


(SEQ 


ID 


NO: 243 


AAGGCAGCTG 


(SEQ 


ID 


NO: 245 


GGCAGCTGTA 


(SEQ 


ID 


NO: 247 


CAGCTGTAGA 


(SEQ 


ID 


NO: 249 


GCTGTAGATC 


(SEQ 


ID 


NO: 251, 


TGTAGATCTT 


(SEQ 


ID 


NO: 2 53, 


TAGATCTTAG 


(SEQ 


ID 


NO:255] 


GATCTTAGCC 


(SEQ 


ID 


NO: 257; 


TCTTAGCCAC 

X w X X<^VA7WW4n>W 


(SEO 


ID 


NO -259 


TTAGCCACTFT 

X Xf%W W WkW X X 


(SEO 


ID 

X \f 


NO - 2fil 1 


AGCCACTTTT 

£*\J\^\^fa.\~ X X X X 


(SEO 


ID 

XX/ 




CCACTTTTTA 

A X X X x^% 


(SEO 


ID 


NO-2651 


ACTTTTTAAA 

mTmJm** mm, mm mm mm mm mrnmmmm 


(SEQ 


ID 


NO:267] 


TTTTTAAAAG 

« mm mm mm *w* mrmm Wlw 


(SEQ 


ID 


NO;269) 


TTTAAAAGAA 


(SEO 


ID 




TAAAAGAAAA 


(SEO 


ID 

xxy 


NO* 2731 


AAAGAAAAGG 


(SEO 

\0Xj^ 


ID 

XX/ 


NO -2751 


AGAAAAGGGG 


(SEO 


TD 

XX/ 


NO- 2771 


AAAAGGGGGG 

* 


(SEO 


ID 


NO •2791 


AAGGGGGGAC 


(SEO 


ID 


NO -2811 


GGGGGGACTG 


(SEQ 


ID 


NO:283) 


GGGGACTGGA 


(SEQ 


ID 


NO:28S) 


GGACTGGAAG 


(SEQ 


ID 


NO:287) 


ACTGGAAGGG 


(SEQ 


ID 


NO:289) 


TGGAAGGGCT 


(SEQ 


ID 


NO:291) 


GAAGGGCTAA 


(SEQ 


ID 


N0:293) 


AGGGCTAATT 


(SEO 


ID 

XX/ 


NO-29S1 


GGCTAATTCA 


(SEQ 


ID 


NO:297) 


CTAATTCACT 


(SEQ 


ID 


NO: 29 9) 


AATTCACTCC 


(SEQ 


ID 


NO: 301) 


TTCACTCCCA 


(SEQ 


ID 


NO:303) 


CACTCCCAAA 


(SEQ 


ID 


N0:305) 


CTCCCAAAGA 


(SEQ 


ID 


NO:307) 


CCCAAAGAAG 


(SEQ 


ID 


N0:309) 
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CCAAAGAAGA (SEQ ID NO:310) ; 

AAAGAAGACA (SEQ ID NO: 312) ; 

AGAAGACAAG (SEQ ID NO: 314) ; 

AAGACAAQAT (SEQ ID NO: 316) ; 

GACAAQATAT (SEQ ID NO: 318) ; 

CAAGATATCC (SEQ ID NO:320) ; 

AGATATCCTT (SEQ ID NO: 322) ; 

ATATCCTTGA (SEQ ID NO: 324) ; 

ATCCTTGATC (SEQ ID NO: 326) ; 

CCTTGATCTO (SEQ ID NO:328) ; 

TTGATCTGTG (SEQ ID NO:330); 

GATCTGTGGA (SEQ ID NO: 332) ; 

TCTGTGGATC (SEQ ID NO: 334) ; 

TGTGGATCTA (SEQ ID NO: 336) ; 

TGGATCTACC (SEQ ID NO: 338) ; 

GATCTACCAC (SEQ ID NO: 340) ; 

TCTACCACAC (SEQ ID NO: 342) ; 

TACCACACAC (SEQ ID NO: 344) ; 

CCACACACAA (SEQ ID NO: 346); 
ACACACAAG6 (SEQ ID NO: 348) ; 
ACACAAGGCT (SEQ ID NO: 350) ; 
ACAAGGCTAC (SEQ ID NO: 3 52) ; 
AAGGCTACTT (SEQ ID NO:354); 
GGCTACTTCC {SEQ ID NO: 356) ; 
CTACTTCCCT (SEQ ID NO: 3 58) ; 
ACTTCCCTGA (SEQ ID NO: 3 60) ; 
TTCCCTGATT (SEQ ID NO: 3 62); 
CCCTGATTGG (SEQ ID NO: 3 64) ; 
CTGATTGGCA (SEQ ID NO:366) ; 
GATTGGCAGA (SEQ ID NO: 368) ; 
TTGGCAGAAC (SEQ ID NO:370); 
G6CAGAACTA (SEQ ID NO: 372) ; 
CAGAACTACA (SEQ ID NO: 374) ; 
GAACTACACA (SEQ ID NO: 376) ; 
ACTACACACC (SEQ ID NO: 378) ; 
TACACACCAG (SEQ ID NO: 380); 
CACACCAGGO (SEQ ID NO: 382); 
CACCAGG6CC (SEQ ID NO:384) ; 



- 244 - 

CAAAGAAGAC (SEQ ID NO: 311) 

AAGAAGACAA (SEQ ID NO: 313) 

GAAGACAAGA (SEQ ID NO: 315) 

AGACAAGATA (SEQ ID RD:317) 

ACAAGATATC (SEQ ID NO:319) ; 

AAGATATCCT (SEQ ID NO: 321) ; 

GATATCCTTG (SEQ ID NO: 3 23) ; 

TATCCTTGAT (SEQ ID NO: 325) ; 

TCCTTGATCT (SEQ ID NO: 327) ; 

CTTGATCTGT (SEQ ID NO: 329) ; 

TGATCTGTGG (SEQ ID NO:331) ; 

ATCTGTGGAT (SEQ ID NO: 3 33) ; 

CT6TGGATCT (SEQ ID NO: 335) ; 
GTGGATCTAC (SEQ ID NO: 337) ; 
GGATCTACCA (SEQ ID NO:339); 
ATCTACCACA (SEQ ID NO: 341) ; 
CTACCACACA (SEQ ID NO: 343); 
ACCACACACA (SEQ ID NO:345); 
CACACACAAG (SEQ ID NO: 347) ; 
CACACAAG6C (SEQ ID NO: 349) ; 
CACAAGGCTA (SEQ ID NO: 351) ; 
CAAGGCTACT (SEQ ID NO: 353) ; 
AGGCTACTTC (SEQ ID NO:355) ; 
GCTACTTCCC (SEQ ID NO: 357) ; 
TACTTCCCTG (SEQ ID NO: 359) ; 
CTTCCCTGAT (SEQ ID NO:361) ; 
TCCCTGATTG (SEQ ID NO: 363) ; 
CCTQATTGGC (SEQ ID NO: 365) ; 
TGATTGGCAG (SEQ ID NO: 3 67) ; 
ATTGGCAGAA (SEQ ID NO:369) ; 
TGGCAGAACT (SEQ ID NO: 371) ; 
GCAGAACTAC (SEQ ID NO: 373) ; 
AGAACTACAC (SEQ ID NO: 375) ; 
.AACTACACAC (SEQ ID NO: 377) ; 
CTACACACCA (SEQ ID NO: 379) ; 
ACACACCAGG (SEQ ID NO: 381) ; 
ACACCAGGGC (SEQ ID NO: 383) ; 
ACCAGGGCCA (SEQ ID NO:385) ; 
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CCAGGGCCAG {SEQ ID NO: 3 86) 
AGGGCCAGGG (SEQ ID NO: 388) 
GGCCAGGGGT (SEQ ID NO: 3 90) 
CCAGGGGTCA (SEQ ID NO: 3 92) 
AGGGGTCAGA (SEQ ID NO: 3 94} 
GGGTCAGATA (SEQ ID NO: 396) 
GTCAGATATC (SEQ ID NO: 3 98) 
CAGATATCCA (SEQ ID NO: 4 00) 
GATATCCACT (SEQ ID NO.-402) , 
TATCCACTGA (SEQ ID NO: 404) 
TCCACTGACC (SEQ ID NO: 4 06) j 
CACTGACCTT (SEQ ID NO:408) , 
CTGACCTTTG (SEQ ID NO:410) j 

GACCTTTGGA (SEQ ID NO: 4 12) ; 

CCTTTGGATG (SEQ ID NO: 414) j 

TTTGGATGGT (SEQ ID NO: 4 16) ; 

TGGATGGTGC (SEQ ID NO: 418) ) 

GATGGTGCTA (SEQ ID NO: 420) ; 

TGGTGCTACA (SEQ ID NO: 422) ; 

GTQCTACAA6 (SEQ ID NO: 4 24) ; 

GCTACAAGCT (SEQ ID NO: 426) ; 

TACAAGCTAG (SEQ ID NO:428) ; 

CAAGCTAGTA (SEQ ID NO:430) ; 

AGCTAGTACC (SEQ ID NO: 432) ; 

CTAGTACCAG (SEQ ID NO: 434) ; 

AGTACCAGTT (SEQ ID NO: 43 6) ; 

TACCAGTTGA (SEQ ID NO:438) ; 

CCAGTTGAGC (SEQ ID NO:440) ; 

AGTTGAGCCA (SEQ ID NO: 442) ; 

TTGAGCCAGA (SEQ ID NO: 444) ; 

GAGCCAGATA (SEQ ID N0:446) ; 

GCCAGATAAG (SEQ ID NO: 44 8) ; 

CAGATAAGGT (SEQ ID NO: 450) ; 

GATAAGGTAG (SEQ ID NO: 452) ; 

TAAGGTAGAA (SEQ ID NO: 454) ; 

AGGTAGAAGA (SEQ ID NO:456) ; 

GTAGAAGAGG (SEQ ID NO: 458) ; 

AGAAGAGGCC (SEQ ID NO:460) ; 
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245 - 

CAGGGCCAGG (SEQ ID NO: 387) 
GGGCCAGGGG (SEQ ID NO:389) 
GCCAGGGGTC (SEQ ID NO: 391); 
CAGGGGTCAG (SEQ ID NO: 393) ; 
GGGGTCAGAT (SEQ ID NO: 3 95) j 
GGTCAGATAT (SEQ ID NO: 3 97) ; 
TCAGATATCC (SEQ ID NO: 399) ; 

AGATATCCAC (SEQ ID NO: 401) j 

ATATCCACTG (SEQ ID NO: 403); 

ATCCACTGAC (SEQ ID NO: 405) ; 
CCACTGACCT (SEQ ID NO: 4 07) ; 

ACTGACCTTT (SEQ ID NO: 409) ; 

TGACCTTTGG (SEQ ID NO: 4 11) ; 

ACCTTTGGAT (SEQ ID NO: 413); 

CTTTGGATGG (SEQ ID NO: 4 15) ; 

TTGGATGGTG (SEQ ID NO:417) ; 

GGATGGTGCT (SEQ ID NO:419); 

ATGGTGCTAC (SEQ ID NO: 421); 

GGTGCTACAA (SEQ ID NO: 423) ; 

TGCTACAAGC (SEQ ID NO: 425) ; 

CTACAAGCTA (SEQ ID NO: 427) ; 

ACAAGCTAGT (SEQ ID NO: 429) ; 

AAGCTAGTAC (SEQ ID NO:431); 

GCTAGTACCA (SEQ ID NO:433); 

TAGTACCAGT (SEQ ID NO:43S) ; 

GTACCAGTTG (SEQ ID NO: 437) ; 

ACCAGTTGAG (SEQ ID NO: 439) ; 

CAGTTGAGCC (SEQ ID NO:441); 

GTTGAGCCAG (SEQ ID NO: 443) ; 

TGAGCCAGAT (SEQ ID NO: 445) ; 

AGCCAGATAA (SEQ ID NO: 447) ; 

CCAGATAAGG (SEQ ID NO:449) ; 

AGATAAGGTA (SEQ ID NO:451) ; 
.ATAAGGTAGA (SEQ ID NO: 453) ; 

AAGGTAGAAG (SEQ ID NO:455) ; 

GGTAGAA6AG (SEQ ID NO: 457) ; 

TAGAAGAGGC (SEQ ID NO: 459) ; 

GAAGAGGCCA (SEQ ID NO:461) ; 
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AAGAGGCCAA (SEQ ID NO: 462) ; 

GAGGCCAATA (SEQ ID NO: 464) ; 

GGCCAATAAA (SEQ ID NO: 4 66) ; 

CCAATAAAGG (SEQ ID NO: 468) ; 

AATAAAGGAG (SEQ ID NO: 470) ; 

TAAAGGAGAG (SEQ ID NO:472); 

AAGGAGAGAA (SEQ ID NO: 474} ; 

GGAGAGAACA (SEQ ID NO: 476) ; 

AGAGAACACC (SEQ ID NO: 478) ; 

AGAACACCAG (SEQ ID NO: 480); 

AACACCAGCT (SEQ ID NO: 482); 

CACCAGCTTG (SEQ ID NO: 4 84); 

CCAGCTTGTT (SEQ ID NO:486) ; 
AGCTTGTTAC (SEQ ID NO: 488) ; 

CTTGTTACAC (SEQ ID NO: 490) ; 
TGTTACACCC (SEQ ID NO: 492); 
TTACACCCTG (SEQ ID NO: 494) ; 
ACACCCTGTG (SEQ ID NO: 496) ; 
ACCCTGTGAG (SEQ ID NO:498); 
CCTGTGAGCC (SEQ ID NO: 500) ; 
TGTGAGCCTG (SEQ ID NO: 502) ; 
TGAGCCTGCA (SEQ ID NO: 504) ; 
AGCCTGCATG (SEQ ID NO:506) ; 
CCT0CATG6A (SEQ ID NO: 508) ; 
TGCATGGAAT (SEQ ID NO: 510); 
CATGGAATGG (SEQ ID NO: 512); 
TGGAATGGAT (SEQ ID NO: 514); 
GAATGGATGA (SEQ ID NO: 516) ; 
ATGGATGACC (SEQ ID NO: 518) ; 
GGATGACCCT (SEQ ID NO : 520); 
ATGACCCTGA (SEQ ID NO: 522) ; 
GACCCTGAGA (SEQ ID NO: 524) ; 
CCCT6AGAGA (SEQ ID NO: 526) ; 
CT6AGAGAGA (SEQ ID NO: 528) ; 
GAGAGAGAAG (SEQ ID NO: 530) ; 
GAGAGAAGTG (SEQ ID NO:532) ; 
GAGAAGTGTT (SEQ ID NO: 534) ; 
GAAGTGTTAG (SEQ ID NO: 536); 
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A6AGGCCAAT 


(SEQ 


ID 


NO: 463) 


A6GCCAATAA 


(SEQ 


ID 


NO:465) 


GCCAATAAAG 


(SEQ 


ID 


NO:467) 


CAATAAAGGA 


(SEQ 


ID 


NO: 469) . 


ATAAAGGAGA 


(SEQ 


ID 


NO:471) , 


AAAGGAGAGA 


(SEQ 


ID 


NO:473) , 


AGGAGAGAAC 


(SEQ 


ID 


NO: 475) , 


GAGAGAACAC 


(SEQ 


ID 


NO: 477) , 


GAGAACACCA 


(SEQ 


ID 


NO: 479) ; 


GAACACCAGC 


(SEQ 


ID 


NO:481) ; 


ACACCAGCTT 


(SEQ 


ID 


NO: 483) ; 


ACCAGCTTGT 


(SEQ 


ID 


NO:485) ; 


CAGCTTGTTA 


(SEQ 


ID 


NO:487>; 


GOTTGTTACA 


(SEQ 


ID 


NO:489) ; 


TTGTTACACC 


(SEQ 


ID 


NO: 491); 


GTTACACCCT 


(SEQ 


ID 


NO: 493) ; 


TACACCCTGT 


(SEQ 


ID 


NO:495) ; 


CACCCTGTGA 


(SEQ 


ID 


NO: 497) ; 


CCCTGTGAGC 


(SEQ 


ID 


NO:499) ; 


CTGTQAGCCT 


(SEQ 


ID 


NO: 501) ; 


GTGAGCCTGC 


(SEQ 


ID 


NO: 503) ; 


GAGCCTGCAT 


(SEQ 


ID 


NO: 505) ; 


GCCTGCATGG 


(SEQ 


ID 


NO: 507) ; 


CTGCATGGAA 


(SEQ 


ID 


NO: 509) ; 


GCATGGAATG 


(SEQ 


ID 


NO: 511) ; 


ATGGAATGGA 


(SEQ 


ID 


NO: 513) ; 


GGAATGGATG 


(SEQ 


ID 


NO: 515) ; 


AATGGATGAC 


(SEQ 


ID 


NO: 517) ; 


TGGATGACCC 


(SEQ 


ID 


NO: 519) ; 


GATGACCCTG 


(SEQ 


ID 


NO: 521) ; 


TGACCCTGAG 


(SEQ 


ID 


NO: 523) ; 


ACCCTGAGAG 


(SEQ 


ID 


NO: 525) ; 


CCTGAGAGAG 


(SEQ 


ID 


NO: 527) ; 


TGAOAGAGAA 


(SEQ 


ID 


NO:529); 


AGA6A6AAGT 


(SEQ 


ID 


NO: 531) ; 


AGAGAA6TGT 


(SEQ 


ID 


NO: 533); 


AGAAGTGTTA 


(SEQ 


ID 


NO: 535) ; 


AAGTGTTAGA 


(SEQ 


ID 


NO: 537) ; 
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AGTGTTAGAG 


(SEQ 


ID 


NO: 538 


) ; 


GTGTTAGAOT 


(SEQ ID NO: 539) ; 


TGTTAGAGTG 


(SEQ 


ID 


NO: 540, 


) ; 


GTTAGAGTGG 


(SEQ ID NO: 541) ; 


TTAGAGTGGA 


(SEQ 


ID 


NO: 542! 


i ; 


TAGAGTGGAG 


(SEQ ID NO: 543) ; 


AGAGTGGAGG 


(SEQ 


ID 


NO : 544 ! 


( ; 


GAGTGGAGGT 


(SEQ ID NO: 545) ; 


AGTGGAGGTT 


(SEQ 


ID 


NO: 546] 


; 


GTGGAGGTTT 


(SEQ ID NO: 547) ; 


TGGA66TTT6 


(SEQ 


ID 


NO: 548.] 


; 


GGAGGTTTGA 


(SEQ ID NO: 549) ; 


GAGGTTTGAC 


(SEQ 


ID 


NO: 550] 


; 


AGGTTTGACA 


(SEQ ID NO: 551) ; 


GGTTTGACAG 


(SEQ 


ID 


NO: 552] 


; 


GTTTGACAGC 


(SBQ ID NO: 553) ; 


TTTGACAGCC 


(SEQ 


ID 


NO: 554] 


; 


TTGACAGCCG 


(SEQ ID NO: 555) ; 


TGACAGCCGC 


(SEQ 


ID 


NO: 556) 


; 


GACAGCCGCC 


(SEQ ID NO: 557) ; 


ACAGCCGCCT 


(SEQ 


ID 


NO: 558) 


; 


CAGCOGCCTA 


(SEQ ID NO: 559) ; 


AGCCGCCTAG 


(SEQ 


ID 


NO:560) 


; 


GCCGCCTAGC 


(SEQ ID NO:561) ; 


CCGCCTAGCA 


(SEQ 


ID 


NO: 562) 


; 


CGCCTAGCAT 


(SEQ ID NO: 563) ; 


GCCTAGCATT 


(SEQ 


ID 


NO: 564) 


; 


CCTAGCATTT 


(SEQ ID NO: 565) ; 


CTAGGATTTC 


(SEQ 


ID 


NO: 566) 


; 


TAGCATTTCA 


(SEQ ID NO: 567) ; 


AGCATTTCAT 


(SEQ 


ID 


NO:568) 


; 


GCATTTCATC 


(SEQ ID NO:569) ; 


CATTTCATCA 


(SEQ 


ID 


NO: 570) 


; 


ATTTCATCAC 


(SEQ ID NO: 571} ; 


TTTCATCACO 


(SEQ 


ID 


NO: 572) 


; 


TTCATCACGT 


(SEQ ID NO:573); 


TCATCACGTG 


(SEQ 


ID 


NO: 574) 


* 

r 


CATCACGTGG 


(SEQ ID NO:575); 


ATCACGT66C 


(SEQ 


ID 


NO: 576) 


7 


TCACGTGGCC 


(SEQ ID NO: 577); 


CACGTG6CCC 


(SEQ 


ID 


NO: 578) 


; 


ACGTGGCCCG 


(SEQ ID NO:579) ; 


CGTGGCCCGA 


(SEQ 


ID 


NO: 580) 


# 


GTGGCCCGAG 


(SEQ ID NO:581) ; 


TGGCCCGAGA 


(SEQ 


ID 


NO: 582) 


/ 


GGCCCGAGAG 


(SEQ ID NO: 583) ; 


GCCCGAGAGC 


(SEQ 


ID 


NO j 584) 


s 


CCCGAGAGCT 


(SEQ ID NO:585) ; 


CCGAGAGCTG 


(SEQ 


ID 


NO: 586) 


; 


CGAGAGCTGC 


(SEQ ID NO: 587) / 


GAGAGCTGCA 


(SEQ 


ID 


NO: 588) 


; 


AGAGCTGCAT 


(SEQ ID NO: 589) ; 


GAGCTGCATC 


(SEQ 


ID 


NO: 590) 




AGCTGCATCC 


(SEQ ID NO: 591) ; 


GCTGCATCCG 


(SEQ 


ID 


NO: 592) 


• 

9 


CTGCATCCGG 


(SEQ ID NO: 593) ; 


TGCATCCGGA 


(SEQ 


ID 


NO: 594) 


1 


GCATCCGGAG 


(SEQ ID NO:595) ; 


CATCCGGAGT 


(SEQ 


ID 


NO: 596) 


; 


ATCCGGAGTA 


(SBQ ID NO: 597) ; 


TCCGGAGTAC 


(SEQ 


ID 


NO: 598) 


; 


CCGGAGTACT 


(SBQ ID NO: 599) ; 


CGGAGTACTT 


(SEQ 


ID 


NO: 600) 


• 


GGAGTACTTC 


(SEQ ID NO: 6 01) ; 


GAGTACTTCA 


(SEQ 


ID 


NO: 602) 


; 


AGTACTTCAA 


(SEQ ID NO: 603) ; 


GTACTTCAAG 


(SEQ 


ID 


NO: 604) 




TACTTCAAGA 


(SBQ ID NO: 605) ; 


ACTTCAAGAA 


(SEQ 


ID 


NO: 606} 


; 


CTTCAAGAAC 


(SEQ ID NO: 6 07) ; 


TTCAAGAACT 


(SEQ 


ID 


NO.-60B) 


* 

9 


TCAAGAACTG 


(SEQ ID NO:609) ; 


CAAGAACTGC 


(SEQ 


ID 


NO: 610) 


} 


AAGAACTGCT 


(SEQ ID NO: 611) ; 


AGAACTGCTG 


(SEQ 


ID 


NO: 612} 


; 


GAACTGCTGA 


(SEQ ID NO: 613) . 
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92. A method according to claim 86 wherein said deletion encompasses at least one 
of the following decanucleotides from the LTR region of HTV-1 strain NL4-3: 



QCTTTTTGCC 

TTTTTGCCTG 

TTTGCCTGTA 

TGCCTGTACT 

CCTGTACTGG 

T6TACTGG6T 

TACTGGGTCT 

CTGGGTCTCT 

GGGTCTCTCT 

GTCTCTCTGG 

CTCTCTQGTT 

CTCTGGTTAG 

CTGGTTA6AC 

GGTTA6ACCA 

TTAGACCAGA 

AGACCAGATC 

ACCAGATCTG 

CAGATCTGAG 

GATCTGAGCC 

TCTGAGCCTG 

TGAGCCTGGG 

AGCCTGGGAG 

CCTGGGAGCT 

TGGGAGCTCT 

GGAGCTCTCT 

AGCTCTCTGG 

CTCTCTGGCT 

CTCTGGCTAA 

CTGGCTAACT 

GGCTAACTAG 

CTAACTAGGG . 

AACTAGGGAA 

CTAGGGAACC 

AGGGAACCCA 



(SEQ ID NO: 652) ; 

(SEQ ID NO: 654) ; 

{SEQ ID NO: 656) ; 

(SEQ ID NO: 658) ; 

(SEQ ID NO: 660) ; 

(SEQ ID NO: 662) ; 

(SEQ ID NO: 664) ; 

(SEQ ID NO: 666) ; 

(SEQ ID NO: 668) ; 

(SEQ ID NO: 670); 

(SEQ ID NO: 672); 

(SEQ ID NO: 674) ; 

(SEQ ID NO: 676) ; 

(SEQ ID NO: 678) ; 

(SEQ ID NO:680) ; 

(SEQ ID NO: 682) ; 

(SEQ ID NO: 684) ; 

(SEQ ID NO: 686) ; 
(SEQ ID NO: 688) ; 
(SEQ ID NO: 690) ; 
(SEQ ID NO: 692) ; 
(SEQ ID NO: 694) ; 
(SEQ ID NO: 696) ; 
(SEQ ID NO: 698) ; 
(SEQ ID NO: 700); 
(SEQ ID NO: 702) ; 
(SEQ ID NO: 704) ; 
(SEQ ID NO: 706) ; 
(SEQ ID NO: 708) ; 
(SEQ ID NO: 710) ; 
(SEQ ID NO: 712) ; 
(SEQ ID NO: 714) ; 
(SEQ ID NO: 716) ; 
(SEQ ID NO: 718); 



CTTTTTGCCT 

TTTTGCCTGT 

TTGCCTGTAC 

GCCTGTACTG 

CTGTACTGGG 

GTACTGGGTC 

ACTGGGTCTC 

TGGGTCTCTC 

GGTCTCTCTG 

TCTCTCTGGT 

TCTCTGGTTA 

TCTCTGGTTA 

TGGTTAGACC 

GTTAGACCAG 

TAGACCAGAT 

GACCAGATCT 

CCAGATCTGA 

AGATCTGAGC 

ATCTGAGCCT 

CTGAGCCTGG 

GAGCCTGGGA 

GCCTGGGAGC 

CTGGGAGCTC 

GGGAGCTCTC 

GAGCTCTCTG 

GCTCTCTGGC 

TCTCTGGCTA 

TCTGGCTAAC 

TGGCTAACTA 

GCTAACTAGG 

TAACTAGGGA 

ACTAGGGAAC 

TAGGGAACCC 

GGGAACCCAC 



(SEQ ID NO: 653) ; 

(SEQ ID NO: 655) ; 

(SEQ ID NO: 657) ; 

(SEQ ID NO: 659) ; 

(SEQ ID NO: 661) ; 

(SEQ ID NO: 663) ; 

(SEQ ID NO: 665) ; 

(SEQ ID NO:667); 

(SEQ ID NO: 669) ; 

(SEQ ID NO: 671) ; 

(SEQ ID NO: 673) ; 

(SEQ ID NO:675); 
(SEQ ID NO: 677} ; 
(SEQ ID NO: 679) ; 
(SEQ ID NO: 681) ; 
(SEQ ID NO: 683) ; 
(SEQ ID NO: 685) ; 
(SEQ ID NO: 687) ; 
(SEQ ID NO: 689) ; 
(SEQ ID NO: 691) ; 
(SEQ ID NO: 693) ; 
(SEQ ID NO: 695) ; 
(SEQ ID NO: 697) ; 
(SEQ ID NO: 699) ; 
(SEQ ID NO: 701) ; 
(SEQ ID NO: 703) ; 
(SEQ ID NO: 705) ; 
(SEQ ID NO: 707) ; 
(SEQ ID NO:709) ; 
(SEQ ID NO: 711) ; 
(SEQ ID NO: 713) ; 
(SEQ ID NO: 715) ; 
(SEQ ID NO: 717) ; 
(SEQ ID NO: 719) ; 
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GGAACCCACT 


(SEQ 


ID 


NO: 720] 




; GAACCCACTG 


(SEQ 


ID 


NO: 721) 


AACCCACTGC 


(SEQ 


ID 


NO: 722] 




; ACCCACTGCT 


(SEQ 


ID 


NO: 723) , 


CCCACTGCTT 


(SEQ 


ID 


NO: 724] 




. CCACTGCTTA 


(SEQ 


ID 


NO: 725) , 


CACTGCTTAA 


(SEQ 


ID 


NO: 726] 




; ACTGCTTAAG 


(SEQ 


ID 


NO:7'27) , 


CTGCTTAAGC 


(SEQ 


ID 


NO: 728] 




. TGCTTAAGCC 


(SEQ 


ID 


NO: 729) , 


GCTTAAGCCT 


(SEQ 


ID 


NO: 730) 




; CTTAAGCCTC 


(SEQ 


ID 


NO:731) j 


TTAAGCCTCA 


(SEQ 


ID 


NO: 732) 




; TAAGCCTCAA 


(SEQ 


ID 


NO:733) j 


AAGCCTCAAT 


(SEQ 


ID 


NO: 734) 




; AGCCTCAATA 


(SEQ 


ID 


NO: 735) j 


GCCTCAATAA 


(SEQ 


ID 


NO: 736) 




; CCTCAATAAA 


(SEQ 


ID 


NO:737) ; 


CTCAATAAAG 


(SEQ 


ID 


NO: 738) 




; TCAATAAAGC 


(SEQ 


ID 


NO:739) ; 


CAATAAAGCT 


(SEQ 


ID 


NO: 740) 




; AATAAAGCTT 


(SEQ 


ID 


NO: 741) ; 


ATAAAGCTTG 


(SEQ 


ID 


NO: 742) 




• TAAAGCTTGC 


(SEQ 


ID 


NO: 743) ; 


AAAGCTTGCC 


(SEQ 


ID 


NO: 744} 




■ AAGCTTGCCT 


(SEQ 


ID 


NO:745) ; 


AGCTTGCCTT 


(SEQ 


ID 


NO: 746) 


; GCTTGCCTTG 


(SEQ 


ID 


NO: 747) ; 


CTTGCCTTGA 


(SEQ 


ID 


NO: 748) 


; TTGCCTTGAG 


(SEQ 


ID 


NO: 749) ; 


TGCCTTGAGT 


(SEQ 


ID 


NO: 750) 


i 


• GCCTTGAGTG 


(SEQ 


ID 


NO: 751) ; 


CCTTGAGTGC 


(SEQ 


ID 


NO: 752) 


I 


CTTGAGTGCT 


(SEQ 


ID 


NO: 753) ; 


TTGAGTGCTT 


(SEQ 


ID 


NO: 754) 


i 


TGAGTGCTTC 


(SEQ 


ID 


NO: 755) ; 


GAGTGCTTCA 


(SEQ 


ID 


NO: 756) 


; 


AGTGCTTCAA 


(SEQ 


ID 


NO: 757) ; 


GTGCTTCAAG 


(SEQ 


ID 


NO: 758) 


; 


TGCTTCAAGT 


(SEQ 


ID 


NO: 759) ; 


GCTTCAAGTA 


(SEQ 


ID 


NO: 760) 




CTTCAAGTAG 


(SEQ 


ID 


NO: 761) ; 


TTCAAGTAGT 


(SEQ 


ID 


NO: 762) 




TCAAGTAGTG 


(SEQ 


ID 


NO: 763) ; 


CAAGTAGTGT 


(SEQ 


ID 


NO: 764) 




AAGTAGTGTG 


(SEQ 


ID 


NO: 765) ; 


AGTAGTGTGT 


(SEQ 


ID 


NO: 766) 




GTAGTGTGTG 


(SEQ 


ID 


NO: 767) ; 


TAGTGTGTGC 


(SEQ 


ID 


NO: 768) 




AGTGTGTGCC 


(SEQ 


ID 


NO: 769) ; 


GTGTGTGCCC 


(SEQ 


ID 


NO:770) 


; 


TGTGTGCCCG 


(SEQ 


ID 


NO: 771) ; 


GTGTGCCCGT 


(SEQ 


ID 


NO: 772) 


9 


TGTGCCCGTC 


(SEQ 


ID 


NO: 773) ; 


GTGCCCGTCT 


(SEQ 


ID 


NO: 774) 


t 


TGCCOGTCTG 


(SEQ 


ID 


NO: 775) ; 


GCCCGTCTGT 


(SEQ 


ID 


NO: 776) 


i 


CCOGTCTGTT 


(SEQ 


ID 


NO: 777) ; 


CCGTCTGTTG 


(SEQ 


ID 


NO: 778) 


m 

r 


CGTCTGTTGT 


(SEQ 


ID 


NO: 779) ; 


GTCTGTTGTG 


(SEQ 


ID 


NO: 780) 


/ 


TCTGTTGTGT 


(SEQ 


ID 


NO: 781) ; 


CTGTTGTGTG 


(SEQ 


ID 


NO: 782) 


; 


TGTTGTGTGA 


(SEQ 


ID 


NO:783) ; 


GTTGTGTGAC 


(SEQ 


ID 


NO: 784) 


; 


TTGTGTGACT 


(SEQ 


ID 


NO: 785) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 786) 




GTGTGACTCT 


(SEQ 


ID 


NO: 787) ; 


TGTGTGACTC 


(SEQ 


ID 


NO: 788) 


/ 


GTGTGACTCT 


(SEQ 


ID 


NO: 789) ; 


TGTGACTCTG 


(SEQ 


ID 


NO: 790) 




GTGACTCTGG 


(SEQ 


ID 


NO: 791) ; 
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TGACTCTGGT (SEQ ID 

ACTCTGGTAA (SEQ ID 

TCTGGTAACT (SEQ ID 

TGGTAACTAG (SEQ ID 



NO: 792) ; GACTCTGGTA 

NO: 794) ; CTCTG6TAAC 

NO: 796) ; CTGGTAACTA 

NO: 798) ; GGTAACTAGA 



(SEQ ID NO: 793) ; 

(SEQ ID NO: 795) ; 

(SEQ ID NO i 797}; 

(SEQ ID NO: 799) . 
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TGGAAGGGCTAATTCACTCACGGAAAAGACCAGTTGAACCAG 

AAGAAGATAGAAGAGGCCOTGAAGAAGAAAACAACAGATTGT 

TCTGCTTGCTCAGCTGGGGACTTTCC^GAAGGCGCGGCCTGA 

GTGACTAAGCCCCGTTGGGGACTTTCCGAAGAGGCATGAAGG 

GACTTTCCAAGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTG 

GCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTCTGCCT 

GTACTGGGTCTCTCTGGTTAGACCAGATCTGAGCCTGGGAGC 

TCTCTGGCTAGCTAGGGAACCCACTGCTTAAGCCTCAATAAA 

GCTTGCCTTGAGTGCTTCAAGTAGTGTGTGCCCGTCTGTTGT 

GTGACTCTGGTATCTAGAGATCCCTCAGACCATTTTAGTCCG 

TGTGGAAAATCTCTAGCAGTGGCGCCCGAACAGGGACTTGAA 

AGCGAAAGGAAAACCAGAGGAGCTCTCTCGACGCAGGACTCG 

GCTTGCTGAAGCGCGCACGGCAAGAGGCGAGGGGCGGCGACT 

GGTGAGTACGCCGAAAATTTTGACTAGCGGAGGCTAGAAGGA 

GAGAGATGGGTGCGAGAGCGTCAATATTAAGCGGGGGAAAAT 

TAGATAGATGGGAGAAAATTCGGTTAAGGCCAGGAGGAAAGA 

AAAAGTATAAATTAAAACATATAGTATGGGCAAGCAGGGAGC 

TAGAACGATTCGCAGTCAATCCTGGCCTGTTGGAAACATCAG 

AAGGCTGTAGACAAATACTGGGACAGTTACACCCGTCCCTTC 

AGACAGGATCAGAAGAACTTAAATCAGTATATAATGCAGTAG 

CAGTCCTCTATTGTGTGCATCAAAACATAGACATAAAGGACA 

CCAAGGAAGCTTTAGAAAAGATAGAGGAAGAGCAAAACAAAT 

GTAAGAAAAAAGCACAGGAAGCAGCAGCACAGCAAGCAGCAG 

CTGGCACAGGAAACAGCAACCCGGTCAGCCAAAATTACCCTA 

TAGTACAGAACATGCAGGGGCAAATGGTACATCAGGCCATAT 

CACCTAGAACTTTAAATGCATGGGTAAAAGTAATAGAAGAGA 

AGGCTTTCAGCCCAGAGGTAATACCCATGTTTTCAGCATTAT 

CAGAAGGAGC CAC CCCACAAGATTTAAACAC CATGCTAAACA 

CAGTGGGGGGACATCAAGCAGCTATGCAAATGTTAAAAGAGA 
CCATCAATGAGGAAGCTGCAGAATGGGATAGATTACATCCAG 
CGCAGGCAGGGCCTGTTGCACCAGGCCAGATGAGAGACCCAA 
GGGGAAGTGACATAGCAGGAACTACTAGTACC CTTCAGGAAC 
AAATAGGATGGATGACAGGTAATC CAGCTATCC CAGTAGGAG 
AAAT CTATAAAAGATGGATAATCCTGGGATTAAATAAAATAG 
TAAGGATGTATAGCCCTATCAGCATTCTGGACATAAAACAAG 
GACCAAAGGAACCCTTTAGAGACTATGTAGACCGGTTGTATA 
AAACTCTAAGAGCCGAGCAAGCTACACAGGAGGTAAAAAATT 
GGATGACAGAAACCTTGTTGGTCCAAAATGCAAACCCAGATT 
GTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACTAG 
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AAGAAATGATGACAGCATGTCAGGGAGTGGGAGGACCCAGCC 

ATAAAGCAAGAGTTTTGGCAGAAGCAATGAGCCAAGCAACAA 

ATGCAGCTACTGTAATGATGCAGAGAAGCAATTTTAGAAACC 

AAAGAAAGAATGTTAAGTGTTTCAATTGTGG CAAAGAAGGGC 

ACATAGCCAGAAATTGCAGGGCTCCTAGGAAAAGGGGCTGTT 

GGAAATGTGGAAAGGAAGGACACCAAATGAAAGATTGTACTG 

AGAGACAGGCTAATTTTTTAGGGAAAATCTGGCCTTCCCACA 

AGGGGAGGCCAGGGAACTTTCTTCAGAGCAGGCCAGAACCAA 

CAGCCCCTCTCCAGGGCAGGCCGGAGCCATCAGCCCCGCCAG 

AAGAGAGCTTCAGGTTTGGGGAGGAGACAACAACTCCCTCTC 

AGAAGCAGGAGCCGATAGACAGGGACAGGGATCTGTATCCTT 

TAGCTTCCCTCAGATCACTCTTTGGCAACGACCCCTCGTCAC 

AATAAAGATAGGGGGGCAGCTGAAGGAAGCTCTATTAGATAC 

AGGAGCAGATGATACAGTATTAGAAGACATGCATTTGCCAGG 

AAAATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTAT 

CAAAGTAAAACAATATGATGAAATTCTTGTAGAAATCTGTGG 

ACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCTGT 

CAACATAATTGGAAGAAATCTGTTGACTCAGATTGGTTGCAC 

TTTAAATTTTCCCATTAGTCCTATTGAAACTGTACCAGTACA 

ATTAAAGCCAGGAATGGATGGCCCAAAGGTTAAACAATGGCC 

ATTGACAGAAGAGAAAATAAAAGCATTAGTAGAAATTTGTAC 

AGAAATGGAAAAGGAAGGAAAGATTTCAAAAATTGGGCCTGA 

AAATC CATACAATACTCCAGTATTTGCCATAAAGAAAAAAGA 

TGGTACTAAATGGAGAAAATTAGTAGATTTCAGAGAC CTTAA 

TAAGAGAACTCAAGACTTCTGGGAAGTTCAATTAGGAATACC 

ACATCCCTCAGGATTAAAAAAGAAAAAATCAGTAACAGTACT 

GGATGTGGGTGATGCATACTTTTCAGTTCCCTTAGATGAAAA 

CTTCAGGAAGTATACTGCATTTACCATACCTAGTATAAATAA 

T GAGACACCAGGGATTAGATAT CAGTACAATGTGCTTCCACA 

GGGATGGAAAGGAT CACCAGCAATATTC CAAAGTAGCATGAC 

AAGAATCTTAGAGCCTTTTAGAAGACAAAATCCAGACATAGT 

TATCTATCAATACATGGATGACTTGTATGTAGGATCTGATTT 

AGAAATAGGACAGCATAGAATAAAAATAGAGGAACTGAGACA 

ACATCTGTTGAAGTGGGGATTTACCACACCAGACAAAAAGCA 

TCAGAAAGAACCCCCATTCCTTTGGATGGGTTATGAACTCCA 

TCCTGATAAATGGACAGTGCAACCTATAGTACTGCCAGAAAA 

AGACAGCTGGACTGTCAATGACATACAGAAGTTAGTGGGTAA 

ATTAAATTGGGCAAGTCAGATTTACCCAGGAATTAAAGTAAG 

GCAATTATGTAAACTCCTTAGGGGAACCAAAGCACTAACAGA 
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AGTAATAC CACTAACAGAAGAAG CAGAGCTAGAACTGG CAGA 

AAACAGGGAAATT CTAAGAGAAC CAGTACATGGAGTGTATTA 

T GAC C CATCAAAAGACTTAATAGCAGAAATACAGAAGCAGGA 

GCAAGGCCAATGGACATATCAAATTTATCAAGATCAATTTAA 

AAATCTAAAAACAGGAAAGTATGCAAGATTGAGGGGTGCCCA 

CACTAATGATGTAAAACAATTTCCAGAGGCAGTGCAAAAAAT 

AGCCACAGAAAGCATAGTAATATGGGGAAAGACTCCTAAATT 

TAGACTACCCATACAAAAAGAAACATGGGACGCATGGTGGAC 

AGAGTATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGT 

CAATACCCGTCCCCTAGTAAAATTATGGTACCAGTTAGAAAA 

AGAACCCATAATAGGAGCAGAAACTTTCTATGTAGATGGGGC 

AGCTAACAGAGAGACTAAATTAGGAAAAGCAGGATATGTTAC 

TGACAGAGGAAGACAAAAAGTTGTCTCCCTAACTGACACAAC 

AAATCAGAAGACTGAGTTACAAGCAATTCATCTAGCTTTGCA 

GGATTCAGGATTAGAAGTAAACATAGTAACAGACTCACAGTA 

TG CATTAGGAATCATT CAAGCACAAC CAGATAAAAGTGAAT C 

AGAAATAGTCAATCAAATAATAGAGCAATTAATAAAAAAGGA 

AAAGGTCTACCTGGCATGGGTACCAGCACACAAAGGAATTGG 

AGGGAATGAACAAGTAGATAAATTAGTCAGTGCTGGAATCAG 

GAAAATACTATTTTTAGATGGAATAGATAAGGCACAAGAAGG 

CCATGAGAAATATCAGAGTAATTGGAGAGCAATGGCTAGTGG 

TTTTAACCTGCCACCTATAGTAGCAAAAGAAATAGTAGCCAG 

CTGTGATAAATGTCAGCTAAAAGGAGAAGCCATGCATGGACA 

AGTAGACTGTAGTCCAGGAATATGGCAACTAGATTGTACACA 

TCTAGAAGGAAAAATTATCCTGGTAGCAGTTCATGTAGCCAG 

TGGATATATAGAAGCAGAAGTTATTCCAGCAGAGACAGGGCA 

GGAAACAGCATACTTTATCTTAAAATTAGCAGGAAGGTGGCC 

AGTAAACACAATACATACAGACAATGGCGGCAATTTCATCAG 

TACCACGGTTAAGGCCGCCTGTTGGTGGGCAGGGATCAAGCA 

GGAATTTGGCATTCCCTACAATCCCCAAAGCCAAGGAGTAGT 

GGAAT CTATGAATAGAGAATTAAAGAAAATTATAGGACAGGT 

AAGAGATCAGG CTGAACAT CTTAAGACAGCAGTACAAATGGC 

AGTATTCATCCACAATTTTAAAAGAAAAGGGGGGATTGGGGG 

ATACAGTGCAGGGGAAAGAATAGTAGACATAATAGCAACAGA 

CATACAAACTAAAGAATTACAAAAGCAAATTACAAAAATTCA 

AAATTTTCGGGTTTATTACAGGGACAGCAGAGATCCACTTTG 

GAAAGGACCAGCAAAACTTCTCTGGAAAGGCGAAGGGGCAGT 

AGTAATACAAGATAATAGTGACATAAAAGTAGTGCCAAGAAG 

AAAAGTAAAGATCATTAGGGATTATGGAAAACAGATGGCAGG 
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TGATGATTGTGTGGCAAGTAGACAGGATGAGGATTAGAACAT 
GGAACAGTTTAGTGAAACACCATATGTATGTTTCAAAGAAAG 
CTAAGGGATGGATTTATAGACATCACTATGAAAACACTCATC 
CAAAAATAAGCTCAGAAGTACACATCCCACTAGGGGAAGCTA 
GATTGGTAATAACAACATATTGGGGTCTACATACAGGAGAAA 
GAGACTGGCATTTGGGTCAGGGAGTCTCCATAGAATGGAGGG 
AAAGGACATATAGAACACAAGTAGACCCCGAACTAGCAGACC 
AACTAATTCATATACATTACTTTGATTGTTTTTCAGAATCTG 
CCATAAGAAGTGCCATATTAGGATATAGAGTTAGGCATAGGT 
GTGAATATCAAGCAGGACATAACAAGGTAGGATCT CTACAGT 
ACTTGGCACTAACAGCATTAATAACACCAAAGAAGATAAAGC 
CACCTTTGCCTAGTGTTGCGAAACTGACAGAGGATAGATGGA 
ACAAGCCCCAGAAGACCAAGGGCCACAGAGG.CAGCCATACAA 
TGAATGGACACTAGAACTTTTAGAGGAGCTTAAGAATGAAGC 
TGTTAGGCATTTTCCTAGGGTATGGCTCCATGGCTTAGGGCA 
ACATATCTATGAAACTTATGGGGATACTTGGGAAGGAGTGGA 
GGCCATAACAAGAACTCTGCAACAACTGCTGTTTATTCATTT 
CAGAATTGGGTGTCAACATAGCAGAATAGGCATTATTCGACA 
GAGGAGAGCAAGAAATGGAGCCAGTAGATCCTAGACTAGAGC 
CCTGGAAGCATCCAGGAAGTCAGCCTAAGACTGCGTGTACCA 
CTTGCTATTGTAAAAAGTGCTGCTTTCATTGCCAAGTTTGTT 
TTATGACAAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGC 
GGAGACAGCGACGAAGAGCTCCTCAAGACAGTCAGACTCATC 
AAGCTTATCTATCAAAGCAGTAAGTAATATATGTAATGCAAC 
CTTTACAAATAGTAGCAATAGTAGCATTAGTAGTAGCAGGAA 
TAATAGCAATAGTTGTGTGGACCATAGTATTCATAGAATATA 
AGAAAATATTAAGACAAAGAAAAATAGACAGGTTGATTGATA 
GAATAAGAGAAAGAGCAGAAGACAGTGGCAATGACAGTGAAG 
GGGATCAGGAAGAATTATCGGCACTTGTGGACATGGGGCACC 
ATGATCCTTGGGATATTAATGATCTGTAGAGCTGCAAACAAT 
TTGTGGGTCACAGTCTATTATGGGGTACCTGTGTGGAGAGAA 
GCAACCACCACTCTATTTTGTGCATCAGATGCCAAGGCATAT 
GATGCAGAGGTACATAATGTTTGGGCCACACATGCCTGTGTA 
CCCACAGACCCTAACCCACAAGAAGTAGAATTGAAAAATGTG 
ACAGAAAATTTTAACATGTGGAAAAATAACATGGTAGAACAG 
ATGCATGAGGATATAATCAGTTTATGGGATCAAAGCCTGAAG 
CCATGTGTAAAATTAACCCCACTCTGTGTTTCTTTAAATTGC 
ACTGATGCTACTAATACCACTAATAGTAATACCACTAGCAGC 
AGCGAGAAACCGAAGGGGACAGGGGAAATAAAAAACTGCTCT 
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TTCAATATCACCACAAGCATAAGAGATAAGGTGCAGAAACAA 

TATGCACTTTTTTATAGCCTTGATGTAGTACCAATGGATGAT 

AATGATAATAGTACAAGCTATAGGTTAATAAGTTGTAACACC 

TCAATCATTACA.CAGGCCTGTCCAAAGATATCCTTTGAGCCA 

ATTCCCATACATTATTGTGCCCCGGCTGGTTTTGCGATTCTA 

AAGTGTAAAGATAAAAGGTTCAATGGAAAAGGACCATGTACA 

AGTGTCAGCACAGTACAGTGTACACATGGAATTAGGCCAGTA 

GTATCAACTCAACTGTTGTTAAATGGCAGTCTAGCAGAAGAA 

GAGGTAGTAATTAGATCTGACAATTTTACGAACAATGCTAAA 

ACCATAATAGTACAGCTGAGCAAATCTGTAGAAATTACTTGT 

GTAAGACCCAACAACAATACAAGAAAAAGTATAAGTATGGGA 

C CAGGGAGAG CATTTTATACAACAGAAATAATAGGAGATATA 

AGACAAGCATATTGTAACATTAGTAAAGCAAACTGGACTGAC 

ACTTTAGAACAGATAGCTAGAAAATTAAGAGAACAATTTGAG 

AATAAAACAATAGTCTTTAAGCCATCCTCAGGAGGGGACCCA 

GAAATTGTAACACAGTTTTACAGTTTTAATTGTGGAGGGGAA 

TTTTTCTACTGTAATTCAACACAACTGTTTAATGGTACTTGG 

AATGGTACTTGGGTTAATGGTACTTGGAGTAGTAATAATACG 

ACTGATACTGCAAATATCACACTCCCATGCAGAATAAAACAA 

TTTATAAACATGTGGCAGGAAGTAGGAAAAGCAATGTATGCC 

CCTCCCATCAAAGGACAAATTAAATGTACATCAAATATTACA 

GGGCTGATATTAACAAGAGATGGTGGTAACAATAACAC CACG 

AACGACAACGAGACCGAGACCTTCAGACCTGGAGGAGGAGAT 

ATGAGGGACAATTGGAGAAGTGAATTATATAAATATAAAGTA 

GTACAAGTTGAAC CATTAGGAGTAGCACCCACCAAGGCAAAG 

AGAAGAGTGGTGCAAAGAGAAAAAAGAGCAGTGGGAATAGGA 

GCTATGTTCCTTGGGTTCTTAGGAGCAGCAGGAAGCACTATG 

GGCGCAGCGTCAGTGACGCTGACGGTACAAGCCAGACAATTA 

TTGTCTGGTATAGTGCAGCAGCAGAACAATCTGCTGAGGGCT 

ATTGAGGCGCAACAGC^TCTGTTGCAACTCACAGTCT GGGGC 

ATCAAACAGCTCCAGGCAAGAGTCCTGGCTGTGGAAAGATAC 

CTAAGGGATCAACAGCTCCTGGGACTTTGGGGTTGCTCTGGA 

AAAOTCATTTGCACCACTACTGTGCCTTGGAACAATAGCTGG 

AGTAATAAATCTCTGGAAACAATTTGGGATAACATGACCTGG 

ATGCAGTGGGAAAGAGAAATTGACAATTACACAAACATAATA 

TACACCTTAATTGAAGAATCGCAGAACCAACAAGAAAAAAAT 

GAACTAGAATTATTGGAATTGGATAAATGGGCAAATTTGTGG 

AATTGGTTTAGTATATCAAACTGGCTATGGTATATAAAATTA 

TTCATAATGGTAGTAGGAGGCTTGGTAGGTTTAAGAATAGTT 
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TTTACTGTACTTTCTATAGTTAATAGAGTTAGGCAGGGATAC 
TCACCATTATCGTTTCAGACCCACCTCCCAACCCCGAAGGGA 
CCCGACAGGCCAGAAGGAATCGAAGAAGAAGGTGGAGAGAGA 
GACAGAGGCAGCTCCACTCGATTAGTGCACGGATTCTTAGCA 
CTTTTCTGGGACGACCTGAGGAGTCTGTGCCTCTTCAGCTAC 
CACCACTTGAGAGACTTACTCTTGATTGTAACGAGGATTGTG 
GAACTTCTGGGACGCAGGGGATGGGAAGCCCTCAAATACTGG 
TGGAATCTCCTGCAGTATTGGAGGCAGGAACTACAGAAGAGT 
GCTGTTAGCTTGTTCAATGGCACGGCCATAGCAGTAGCTGAG 
GGGACAGATAGAGTTATAGAAGCTTTACGAAGGGCTTATAGA 
GCTATTCTCCACATACCTAGAAGAATAAGACAGGGCTTAGAA 
AGGGCTTTGCTATAAAATGGGTGGCAAGTGGTCAGAAAGTAG 
TGTGGTTAGAAGGCATGTACCTTTAAGACAAGGCAGCTATAG 
ATCTTAGCCGCTTTTTAAAAGAAAAGGGGGGACTGGAAGGGC 
TAATTCACTCACGGAAAAGACCAGTTGAACCAGAAGAAGATA 
GAAGAGGCCATGAAGAAGAAAACAACAGATTGTTCTGCTTGC 
TCAGCTGGGGACTTTCCAGAAGGCGCGGCCTGAGTGACTAAG 
C CCCGTTGGGGACTTTCCGAAGAGGCATGAAGGGACTTTCCA 
AGGCAGGCGTGGCCTGGGCGGGACTGGGGAGTGGCGAGCCCT 
CAGATGCTGCATATAAGCAGCTGCTTTCTGCCTGTACTGGGT 
CTCTCTGGTTAGACCAGATCTGAGCCTGGGAGCTCTCTGGCT 
AGCTAGGGAACCCACTGCTTAAGCCTCAATAAAGCTTGCCTT 
GAGTGCTTCAAGTAGTGTGTGCCCGTC^GTTGTGTGACTCTG 
GTATCTAGAGATCCCTCAGACCATTTTAGTCCGTGTGGAAAA 
TCTCTAGCA 
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